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PREFACE 


This book is founded on tlie lecture course given in the 
University of St Andrews to students reading for an 
Ordinary Degree in Mathematics and to first-year Honours 
students. It contains a short and elementary account of 
algebraic equations, both from the theoretical and the 
practical side, together with the algebra of poljniomials 
and rational fractions. It includes classes of number, 
partitions, identities, the G.C.M. process, partial fractions 
and recurring series, but it omits continued fractions and 
indeterminate equations. Cubic and biquadratic equations 
are discussed, together with more general types, elimina- 
tion, and symmetric fimctions, but the theory of invariants 
and of groups is left untouched. 

An elementary knowledge of algebra is presupposed, 
particularly of long division, quadratic equations and the 
binomial theorem, together with elementary determinants, 
coordinate geometry and differential calculus. The more 
advanced deterniinantal and matrical theory of p. 48 and 
p. 141 may be omitted On a first reading. 

The book has been written witli the liistorical develop- 
ment of algebra constantly in mind ; and many of the 
topics have been selected for their im])ortancc, not only 
as part of a general mathematical education, but also as 
preliminaries to the study of all higher algebra. 

I acknowledge with gratitude the luilj) which I have 
derived from the well-known treatises on algebra and 
equations written in England, Ireland, America and 
Germany ; more particularly from the works of Todhunt(w 
and of Professor P. P». Fischer. I also thank the (iditors of 


the present seih's, my colleagues, and the printers for their 


helpful co-operation. 


H. W. TUKNBULL. 
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PREFACE TO THE SECOND EDITION 

The text of the first edition has been revised, and I am 
grateful to friends who have pointed out to me various 
errors, which have been corrected in the present edition. 
A new chapter is added on general methods of root 
expansion, which links the early work of Newton with 
recent discoveries of the Edinburgh School of Algebraists, 
and affords an introduction to the study of bialternants, 

H. W. T. 
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CHAPTBE I 


INTEGERS, MORE GENERAL TYPES OF NUMBER. 
POLYNOMIALS 

1. Integers and Partitions. The concepts of positive 
integers and of their partitions are fundamental in algebra. 
We may imagine the natural numbers 1, 2, 3, ... to be 
arranged in a sequence, as here, according to ascending 
order of magnitude. The term of the sequence is the 
positive integer n. For every integer n there is an integer 
w+1 immediately following ; so that the complete sequence 
of natural numbers has no last term. We may express 
this by saying that no positive integer is infinite, or in 
symbols by writing 0<n< oo. 

If r, s, ..., z are Ic positive integers such that 

n = r+STp ... +3 . . . (1) 

then k must be some number between 1 and n inclusive, 
so that If n is fixed we may regard (1) as an 

equation to determine the unknown.s r, s, ..., z. Usually 
there are several possible solutions and the equation is 
said to be indeterminate ; but if = 1 there is oiio 
solution r = n, and if ^ = n there is again one solution, 
r = s == ... = z — 1. The set {r, s, z) is called a 
partition of n, and it is customary to arrange the parts, 
or terms, r, s, ..., z in ascending, or else in descending, 
order. For example {112} is a partition of 4 into 1+1 +2, 
and {346} of 13 into 3+4+6. Derangement of the terms 
is immaterial, so that we should reckon 1 +2 1 , 2 + 1 + 1 

or 1 + 1+2 to be the same parlUion. It is ca.sy to verify 
that there are five distinct partitions when n — 4 : 

{1 1 1 1}, {1 1 2}, {1 3}, {2 2}, {+). . (2) 

A 
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These are sometimes represented graphically by 
X 

X X /o\ 

XXX XX ■ ^ ' 

X, XX, XXX, XX, xxxx, 

where rows of crosses take the place of parts or terms in 
the partition. In general the top row has r crosses, the 
second, s, and the last, z. We may naturally enquire 
how many distinct partitions of n exist : and if we denote 
this number hjp{n), we can verify that j9(l) = 1, p{2) = 2, 
p{Z) = 3, p(4^) = 5, p{5) = 7, but to find an expression 
for the general value p{n) proves to be a singularly difficult 
undertaking, which has taxed the skill of the greatest 
mathematicians. 

The five partitions of = 4, arranged as above, are 
said to be in lexical order. This manner of arrange- 
ment becomes obvious when the integers 1, 2, 3, ..., 
are replaced by the respective letters a, h, c, ..., and 
each partition is regarded as a “ word,” for then the 
words fall into alphabetical, or lexical, order : aaaa, aab, 
ac, bb, d. Such an order is possible and unique for each 
value of n. 

Conjugate Partitions. The partitions {112} and {13} 
are said to be conjugate. Conjugacy is a mutual relation 
which becomes intuitive when the graphs of crosses are 
examined : the columns of one graph, read from left to 
right, have the same numbers of entries as the rows of 
the conjugate graph read from below to above. Conjugate 
graphs are therefore reflexions of each other in a line 
inclined at 45° to the rows or columns. Every partition 
possesses a conjugate or else is self-conjiogate : {1111} and 
{4} are conjugate, {22} is self- conjugate. 

Example. — Obtain the partitions of n — 5, n “ 0, and 
arrange them in conjugate pairs, and self-conjugate singles. 

2. Rational and Irrational Numbers. In a more 
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general view, integers are either positive, negative or 
zero, and can be arranged in ascending order . . . —3, —2, 

— 1, 0, 1, 2, 3, .... If n is any such integer then 

— oo<K<co. If p is any integer and q is any positive 
integer, the number p-^q (or pjq) is a rational number, a 
concept which leads at once to the theory of fractions, 
proper and improper, of factors, multiples, prime numbers 
and pairs of numbers prime to one another (that is, which 
have no common factor other than unity), with all 
of which terms it wiU be supposed that the reader 
is famihar. By expressing a rational number in decimal 
form we may find that it either terminates or recurs : 
for example 241/100 = 2*41 which terminates, while 
12/11 = 1-090909... = 1-09 recurs. Conversely, every 
terminating or recurring decimal is reducible to a rational 
number of the form pjq, and when p and q are prime to 
each other the reduction is unique. But we may also 
contemplate decimals which neither terminate nor recur, 
such as 0-1234567891011... (which is composed of the 
digits of the positive integers in natural order), or 
1-4142... = ^/2. These represent irrational numbers. 
We include under the name real both rational and irrational 
numbers. As it is impossible to make a complete list 
in ascending order of all the consecutive rationals (much 
less of irrationals) between two integers, it is natural to 
invoke the help of geometry and to represent these 
numbers by points on a straiglit line. Each real number x 
is then represented by a point N upon an unlimited axis 
X'OX, and x is called the coordinate of N referred to an 
origin 0 : and the number x is the distance ON in terms 
of a given unit distance. 

3. Polynomials, Equations, Complex Numbers. 

We may consider numbers from another point of view 
as the roots of equations. If « is a positive integer the 
expression 

j{x) ^a^x^ H-Uoa;"- 24 -... . ( 1 ) 
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is called a polynomial of order or degree n in the variable x 
(briefly, an n-ic in x), whose coefiicients a^, a^, are 

constants, of which is non-zero (ao^i^O). The relation 
f{x) = 0 is then called an equation of order or degree n. 
It is a major problem of mathematics to determine the 
roots of this equation, that is, to find values of x for which 
the equation is true. The attempt to solve the equation — 
to find one or more such roots — ^has led to the concept of 
a new type of number, distinguishable from the real, and 
called a complex number. 

Let us write out the equation systematically for low 
values of n, using a, b, c for the coefficients : 

ax~\-b — 0, 

ax'^+bx+c = 0 , 

ax^+bx^-\~cx-{-d = 0, ‘ * • W 

ax^-\-bx^-\-cx^-{-dx-\-e = 0, 


and so on. These are called tlie linear, quadratic, cubic, 
quartic (or biquadratic), eqxiation respectively. First 
let all the coefficients be integers : then the linear equation 
is soluble in rationals, namely x = —b/a. Conversely 
every rational number is the root of a linear equation 
with integer coefficients ; for example 2-41 is the root of 
lOOr = 241. In particular if a = 1 the root of such an 
equation is an integer. Rational roots may also exist 
for higher equations, as in 2 r 2 _ 3 a;_j_l _ q, of which 
a: = 1 is a root, or in a:3_3a:_i8 = o, of which r = 3 is 
a root ; but in general the roots of non-linear equations 
are irrational. Even with integer coefficients a, 6, c the 
quadratic equation ax'^-^bx-\-c = 0 cannot always be 
solved unless complex numbers are iirtroduced. For the 
elementary method gives the formal solution 


X — 


-6j^V(6a-4ac) 

2a 


, a^^O, 


( 3 ) 


which is only possible in terms of real numbers when 
6^— 4ac^0. If b^—4ac is a perfect square p^, >0, there 
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are two solutions x = —{b±:p)j2a which are both rational. 
If 6 2 — is positive but not a perfect square there are 
two irrational solutions. If 6^ — 4ac = 0 there is one 
solution ; it is rational, and the quadratic is said to have 
a repeated root. But if 6®— 4ac is negative there is no 
real solution : and at one time in the history of mathematics 
such an equation was rejected as impossible. However, 
the more enterprising of mathematical pioneers — Cardan, 
Napier, Wallis, Leibniz and Gauss — boldly went forward, 
asserting the existence of a new type of number, which 
Napier called the ghost of a real number but which 
nowadays is called the complex number. The formula (3) 
gives two complex values of x whenever — 4ac<0, that 
is when 4ac— 6^ is positive. It is usual to reduce x to 
the form 

X • • • (4^) 

where a = — &/2a and ^ = -\/(4«c— 6^), so that both 
a and ^ are real, but i is not real, although it satisfies 
the quadratic equation with real coefficients, 

*2+1 = 0 (5) 

Here we have made two important assumptions : 

(i) that equation (5) has a root i (which can formally 

be written y'— 1), 

(ii) that this root * is a number which combines with 

the real numbers according to the ordinary laws 
of algebra. 

Example. — The equation 2a;®+a;+3 = 0 has complex roots 

1 

— -+* ; SO also has 2x®+6a:+5 = 0, tlio roots of wliich 

4 4 

are — li+i'*. 

4. Geometrical Treatment of Complex Numbers. 

There are several ways of justifying the above a,ssumption.H 
regarding complex numbers, but perhaps the most 
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attractive, and on first consideration the most convincing 
method, is that of Gauss (1797), who represented a complex 
number by a point (a, yS) referred to rectangular 

Cartesian axes Oa:, Oy in the familiar way. 



To each point P of a plane there correspond a number 
pair (a, y8) and a complex number a+iy8. There is thus 
a one-to one correspondence between the finite points of 
the plane and the finite complex numbers. The plane is 
called the complex number plane, the Gauss plane or, some- 
tim(3S, the Argarid diagram. 

In polar coordinates {r, d) we have a — r cos d = ON, 
= r sin ^ NP and therefore 

aA-i^ = r(cos d-\-i sin 6). . . (1) 

We may always avoid any ambiguity by assuming that 

r > 0, — TT < (9 < 7T. . . . (2) 

With these provisos, which attach to each complex number 
one value of r and one of 6, we call r the modulus and d 
the amplitude of the number z — a-\-if^ represented by the 
point P. The notation \z\ is used for the modulus. Thus 

N| = \o.+i^\ = r ~ \a~i^\. . . ( 3 ) 

Also tan d = ^ja or am g = arc tan y8/a = tan-^ ^/a. . (4) 

Real numbers are represented by points on the axis 
of a: or by number pairs of the type (a, 0), where a can be 



COMPLEX NUMBERS 


positive, negative or zero. Pure imaginary numbers 
are represented by points on the axis of y, except the 
origin (0, 0), or by number pairs of the type (0, fi), where 
^>0 or jS<0. For this reason the line y — 0 is sometimes 
called the real axis and the line a; = 0 the imaginary axis. 



The symbol i is to be regarded and defined as an 
operation upon real numbers which changes the real pair 
{a, j8) on which it operates to ( — a), or geometrically, 
the operator i is defined as that which turns the line OP 
in a positive sense to OQ through a right angle. In the 
figure of a circle PQRS, ignore the points ABOD for the 
moment, and suppose the radius r to take any positive value. 

From this definition everything follows. Repeat the 
operation and OP is turned to OR through two right 
angles : three operations Hi turn OP through throe, and 
four operations iiii through four right anglc.s. If z = a-]~i^ 
then the four expressions iz, Hz = Hz, iiiz =: Hz, HHz 
= Hz denote the four complex numbers corresponding to 
Q, R, S, P, the vertices of a square centred at 0, for all 
non-zero values of z. Wo note tliat i(H) — {ii)i = Hi. 

In particular if 2 = I and F is at A, then i, H, H ai-e 
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written short for il, i^l, and correspond to the points 
B, 0, D respectively. But C has coordinates (—1, 0) ; 
hence = —1. This links the present definition of i 
with the previous assumption, that i denoted the root of 
the equation ” 0- 

Next we can show that jSi mean the same number. 
For both correspond to the point (0, j8) — either by talcing 
a length OP = ^ along Ox and turning it through a right 
angle, or by taking a unit length along 0.r, tinning it 
through a right angle to get i and then measuring off ^ 
such units along Oy. 

Addition is defined by the identity 

a+i^+y+iS = {a-{-y)+i{^+S). . (5) 

If P is (a, and R is (y, 8) and OPQR is a parallelogram, 
then Q denotes the complex number so obtained by 
addition. Geometrically the points (0, 0), (a, j8), (y, S), 
(a+y, j8-j-S) form a parallelogram. 



Subtraction follows at once as the inverse of addition. 
If we write, for short, OP-j-OR = OQ, then OQ— OP = OR. 
Or, again, replace y and 8 by —y and —8 in the above, 
and we have 
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Multiplication is defined by the identity 

(a+ijS)(y+iS) = ay—^S-hi{aS-tp^^ 

which is readily formed by the ordinary rules ,-'alg‘ebfi,;. 
and the relation = —1, but is geometrically equivalent' 
to the following rule : 

Let the corresponding polar coordinates be {r, 6} and 
{5, Then 

{r, 0} {s, <^} = {rs, 0-i-c^}, . . . (8) 

that is, multiply the moduli and add the amplitudes to get 
the modulus and the amplitude of the product. If 
or < — 7T, the amplitude is of course the equivalent angle 
between 

To prove that this rule is equivalent to the identity (7) 
we take, as is usual, 

a ~ r cos 0, ^ — r sin 0, y ~ s cos 0, S = 5 sin <56 (9) 

and thus we can write (a, j8) = (r, 0} = r (cos 0-l~i sin 0) 
to denote the same complex number. Hence 

(a-i-i^)(y+iS) 

=r(Gos 0-i-i sin ^)s(cos sin 0) 

= rs(cos 0 cos sin 0 sin 0+j(cos 0 sin ^+sin 0 cos <56)) 

= rs{cos {0+<f>)+i sin {0-{-^ )), 

or (a, /3)(y, S) = {rs, which is what we wished to 

prove. The i-ulo given above for forming the product of 
two complex numbers is in fact a geometrical statement 
of Demoivre’s theorem. 

Division is defined as the converse of multiplication, 
namely, 

{r,0}-Hs,<f>}^{r/s,0-<f>}, s^-0, . . (10) 

for, by the earlier rule, 

{r/s, 0-<j,} {.S’, <!>} = 

Manifestly (r, 0} {.s’, r/j) = {.s, <f>} {r, 0}. 


]-ir. 


0 }. 
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It is well to notice that this rule of multiplication incor- 
porates the earlier geometrical definition of i. Instances of 
(7) are 

{ 2 , 0 } { 1 , l-a} = { 2 , i-TT}, 

{3, hr} {2, -I-tt} = {6, 0}, {1, In} {1, -M = {1, n}, 
that is to say, 2xi = 12 , 3ix( — 2i) = &, ixi = — 1. 

5. Complex Numbers as Roots of Quadratic 
Equations. The number a+i/S is a general complex 
number, separated into real and imaginary parts a, 
respectively. Sometimes ^ is referred to, rather loosely, 
as the imaginary part of the complex number. We 
note that every such complex number is a root of 
a quadratic all of whose coefficients are real : for if 
X = a +4^ then (a;— a)^ = = i^^^ = — ^8®. Hence 

x^~2ax-i-a^-\-^^ = 0, which is a real quadratic equation. 
Similarly, a—i^ is a root. We call these two roots con- 
jugate complex roots, and we say that aiijS form a pair of 
conjugate complex numbers. 

The sum of two conjugate complex numbers is always 
real, the difference (whenever is a pure imaginary. 

We have introduced complex numbers by going no 
further than quadratic equations whose coefficients are 
integers. The questions naturally arise, what happens if 
we allow the coefficients themselves to be fractional, 
irrational or even complex, and again if we consider 
equations of higher degree ? It is one of the dramatic 
surprises of mathematics to find that no further generaliza- 
tion is necessary, in that any such equation can be 
completely solved in terms of complex or simpler numbers. 
This fact, which is called the fundamental theorem of 
algebra, was first proved by Gauss (c/. p. 56). At present 
we shall assume it, and shall illustrate its scope by two 
examples. 

1. To solve the complox linear equation ax-\-h = 0, 
whore a and b arc complex. Let a = a+ip, a = a—ifi ; then 
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calculate x = —baji For instance if (2 + 3i)x-l-4 — 5^ = 0, 
we have 

5?:-4 (5i:-4)(2-3i) 7 + 22i _ 7 22^. 

2 + 3i ~ (2 + 3i)(2^ai) 4-9^^ ~ 

The point of the process is to render the denominator real 
and thus to separate the real and imaginary ];)arts of x. We 
realize the denominator of a complex fraction by a method 
analogous to that of rationalizing the denominator of a surd. 

2. Every polynomial whose coefficients are complex can 
be regarded as a factor of a polynomial whose coefficients are 
real. For example: — Arrange this as 

(2a;® — dx)+{x^ — 2a; + l)i and multiply it by the conjugate 
polynomial (2x‘^ — Zx)~{x^—2x+l)i, obtained by changing 
i to ~i. The result is 

( 2a:® - 3a;)® + (a:® - 2x + 1 )®, 
or 5.^'* — 1 6a;® + 1 — 4a; + 1, which is real. 

6. Algebraic Numbers and Integers. Within the 
scope of real and complex numbers further demarcations 
can be made. If and each coefficient of the poly- 

nomial in 3 (1) is an integer, then each root, real or complex, 
of the equation f{x) = 0 is called an algebraic number : 
conversely any such number must be a root of such an 
equation for a finite value of n. In particular if ofo = 
each root is called an algebraic integer : for instance i is 
an algebraic integer. So also is ^/3, for it is a root of 
X®— 3 = 0 ; and again the complex surd co ~ ( — l-f-i-\/3)/2 
is an algebraic integer, for it is a root of the equation 
m^-f-co + l == 0. The numbers are sometimes called 
the imaginary units, for they have properties resembling 
those of the real units ±1. In fact if a: = or 
then any integral power of x is equal to one or other of 
these four units. 

Algebraic numbers may bo rational or irrational, but 
it does not foUow that all iiTationals are algebraic : such 
as are not so are called transcendental, well-known cxam])l(.‘3 
of wliich are tt = 3-1415926535... and e = 2-7182818284.... 
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Neither of these is the root of an equation of finite degree, 
whose coefficients are integers. The first attempt to prove 
the transcendence of tt and e was made in 1667 by James 
Gregory, but the first strict proof was given by Lindemann 
as recently as 1882. 

Note . — The aritlimetical fovmdations of the theory of 
rational and irrational numbers is due to Dedekind and Cantor. 
For a full treatment of real and complex numbers the reader 
is referred to G. H. Hardy, Pure Mathematics (Cambridge, 
1938), pp. 1-33. 

7. Rational Functions. By an obvious extension of 
ideas we classify functions of a variable x in much the 
same way as numbers, into integral, rational, irrational, 
algebraic and transcendental functions. The result of 
combining x with itself and with constant numbers by 
addition, subtraction and multiplication in any order, 
with or without repetitions, but only with a finite number 
of such elementary operations, is called a rational integral 
function. It is in fact a polynomial, which can alwaj^s 
be arranged in descending powders of x. If division is 
also admitted a finite number of times the function is 
called a rational, but it may be fractional, function. 
Practice with these processes will have convinced the 
reader that any such function can be reduced to one or 
other of the forms 

f{x)!fy{x), f{x), c, 

where both f{x) and are polynomials in x, and c is a 
constant, which may possibly be zero. 

Examples. 

1 a:® — ! 

(i) (“) T’ {x-\-a){x—a) — {x+b){x—b). 

i+! 

X 

(i) Hero/(a;) = .a-^+Sa-, = x+2 ; (ii) f{x) = x + 1 ; 

(iii) c =: h^—a^. 
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Examples of pol 5 momials or rational integral functions : 

2x-\-d, x^, ax^+bx+c, — 3, {px—qY-\~Tx, 2x^ + 3x^ — l, 
where n and p are positive integers. 

Rational fractional functions ; 

1 a+bx 2—3x-\-‘ix^ 1—x”' 1 2 ~ ^ , 

, — — , , x-^—2x-^+x. 

X c-^dx 1 — a;®” x — 1 x — 5 

Irrational fxmctions : ^/x, ^(x^—2), (1+ ^x)f(l —x). 

Transcendental functions : e®, sin x, log x, sin~^a;. 

The above examples are t 3 rpical but not exhaustive. 
The functions also maybe either algebraic or transcendental, 
so that we include under algebraic both rational and 
irrational cases. The feature of an algebraic function 
y — ijj{x) is that it is always possible to express the relation 
between x and y rationally as a polynomial involving 
both variables, say /(a:, y) = 0. For example, 

ify= (1+ ®/a:)/(l-a;), then {y{l—x)—l)^—x= 0. 

This is called a sextic equation in x and y, since the term 
of the highest degree is x^y^. It is impossible to express 
a transcendental function by such an equation of finite 
degree. 

Further examples. Discuss the nature of the following 
functions: ■\/(x^-\-2x-j-l), -\/(ir“ + l), (1— a?) sin“a3/(l — cos^a;), 
{x^ + 2x)lx. [Rational, irrational, polynomial, polynomial.] 

8. Tabular and Graphical Representation of a 
Polynomial. If f{x) is a given real polynomial of degree 
n, and if y — f{x), then it is always possible to represent 
the function f{x) both graphically and by means of a 
table. We take particular values Xj, x^, x^, ... of x and 
calculate the corresponding values y^, y^, y^, ... of y. 
Since f{x) is rational, integral and real, wo obtain one real 
value of y for each x. Had f{x) been a rational fraction 
this would not be the case, for there would be no value 
of y corresponding to such exceptional values of x as cause 
a denominator to vanish. The results can then be 
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tabulated either by rows or by columns in the familiar 
way, and when this is done the table gives a representation 
of the function. While it is a suggestive and useful 
representation, yet it is only approximate, for the values 
of y are usually decimalized and accurate only to a limited 
number of decimal places, and in any ease a complete 
set of values in any interval whatever is out of the question. 
It is useful to arrange the values of x from left to right, 
or else columnwise, in aseendii'g order. 


Example. y =/(.^') — — 


X 

-10 


-3 

—2 

-1 

0 

1 

2 

3 

4 

5 

6 


10 

y 

-1056 


~20 

0 

6 

4 

0 

0 

10 

36 

84 

160 

... 

864 



A 

B 

c 

D 

E 

F 

G 

H 





However full the table is, it is always possible to 
increase it, either by extension or by interpolation. By 
the latter is meant inserting new values of x between 
adjacent values of x already found, yielding the correspond- 
ing y. What, in this example, are the values of y within 
the interval l<a:<2 ? For each such value of x it will 
be found that y is negative. 

Tables such as these were famih'ar to the mathematicians 
of the sixteenth and early seventeenth centuries, who 
developed great skill in devising methods for elaborating 
them : their aim was to render tables of trigoiioinetrical 
and logarithmic functions as complete as possible. But 
a great advance in insight was gained in 1638 when 
Descartes published his method of coordinates, wIu.Ttiby 
any such table could be tlirown into geometrical form by 
plotting points A, B, C, ... whose coordinates were x 
and y, one such point for each pair of entries. Those 
points are distinct and separated : they form a dis- 
continuous graphical representation of the function. 
They do not lie at random, but suggest a curve which 




TABLES AND GRAPHS IS 

runs smoothly through them in the order indicated from 
left to right. 

The greater the number of entries in the table within 
any given interval AH the more clearly will the correspond- 
ing iDoints suggest a curve : and whatever smooth curve 
is drawn through them it will be an approximate 
representation of one precise curve which par excellence 
represents the function. Suppose that this precise curve is 



whatever pair of corresponding values x, y are taken, the 
point P representing them -will lie on the curve. We shall 
call this the graph of the polynomial f{x), or the curve 
whose equation is y = f(^)> or briefly the curve y = f{x). 

Every polynomial with real coefficients has .such a 
graph. The curve extends indefiiritely to the left and 
to the right. It must cut any straight line drawn parallel 
to the axis of y in one point and one point only, since y 
takes one definite value for each value of x. The curve 
is continuous, as will shortly be proved ; it has no gaps. 
Its curvature is continuous ; it has no sudden corners, 
but bends smoothly. It crosses a straight line drawn 
parallel to the axis of x not more than n times. If w = 1 
it is a straight line. If w = 2 it is a parabola, u])w'^ards 
or downwards, with its axis parallel to Oy. Such 
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a curve, we say, has one bend. If % = 3 it has two bends. 
If 71 = 4 it has one or three such bends. If w = 5 it has 
two or four. In general if n = 2r it has an odd number 
of bends less than n, and if w = 2r+l it has an even 
number of bends less than n. For the purpose of drawing 
polynomial graphs, or of visualizing them, these are useful 
facts to bear in mind. The reasons underlying them 
depend on the properties of the derived polynomials 
f{x), f"{x), and of the fundamental theorem that every 
equation has a root. 

By the first, second, ... derived polynomials we mean 

fix) = %, fix) = and so on. Thus 
ax dx^ 

y=f{x) = 

d^v 

and so on. .Each such derivative is a polynomial of degree 
less by unity than its predecessor. Accordingly the 
derivative is a linear polynomial, while the 
derivative is the constant d^hjjdx'" = n\ which is non- 
zero. All higher derivatives must vanish. 

Example. 

y — fix) — 4a^-[-4, y' — f{x) — iix"—2x—i, 

y" =f"(x) = to-2, y'" =f'{x) = Q,f{x) = 0. 



CHAPTER II 


CONTINUITY AND EVALUATION OF 
POLYNOMIALS 

9. Continuity of a Polynomial Function. Let {x, y) 
be the coordinates of a point P, regarded as fixed, which 
lies on the polynomial graph y =f{x). Let Q be another 
point {x+Ji, y+k) also on the graph, so that y-\-k = fix+Ji). 
We shall say that the function f{x) is continuous at x if we 
can make k as small as we please by taking h small enough. 
More precisely, if e is a given positive number however 
small, then k must lie between — e and e (that is --e<k<€) 
whenever h lies between —rj 
and 7), where 7^ is a positive 
number (usually small) which 
can be ascertained when e is 
given. This statement, a little 
difficult at first sight, turns out 
on reflection to agree with our 
idea of continuity for a curve ; 
but it has the merit of giving 0 x 

in precise quantitative form a 

test for what seems to be a qualitative property. Since a 
value of y exists for each value of x there can bo no gap 
in the curve as traversed from left to right ; but there 
might conceivably be one or more gaps vertically. 
Suppose, in such a case, that P is the last point on the 
curve before such a gap, and Q the first point after the 
gap ; then we have merely to choose e less than the 
distance PQ in order to show that such a cux've does 
not fulfil the quantitative test just given. 

When the test in question is satisfied at cacli value of 
17 B 
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X ( — oo'<aj<oo) we say that the function is continuous 
throughout, and so too is its graph. It will now be proved 
that this is always true of a real polynomial /(*). 

Consider first the monomial function y = x^. Then 
y-\-lc = so that 

Tc ~ (a:+^)”— a?” = + = h,A, 

where n(^^) = n{n—\)j2\, n^r) = n{n—\)...{n—r-{-l)lr\, a 
notation for the binomial coefficient. Each number x, h, 
n, n(r) is finite : the series has n terms and each term has 
a finite number of factors. Consequently A is finite. 
Hence k can be made as small as we please by taking h 
small enough. If k is to be made numerically less than e 
we simply choose h numerically less than e/B, where B is any 
non-zero constant which is numerically greater than the 
finite but variable A. Thus the function a;” is continuous 
throughout. Exactly similar remarks apply to x”, to a 
sum of a finite number of such terms, and so to a poly- 
nomial in general. 

10. The Zeros of f(x), f'(x) and f”{x). The values 
of X for which f{x) vanishes are called the zeros of f(x) ; 
they are the roots of the equation f(x) — 0. At each 
zero of f(z) the value of y is zero ; hence if x is a real root 
the corresponding point {x, y) of the curve lies on the axis 
of X. For example the three points B, E, F of the figure 
in 8 correspond to three real roots of the cubic equation 
there represented, for which a: = —2, 1 or 2. 

From the differential calculus wc know that f'{x) gives 
the gradient of the tangent at {x, y) to the curve y = f{x), 
and that/'(x) vanishes at each point where the tangent is 
parallel to the axis of x. Such points are called turning 
points. At other points on the curve /'(x) is either positive 
or negative. If positive, /'(x)>0, then at such points the 
curve is ascending, as x increases in value from left to right : 
if negative, /'(x)<0, the curve is descending as x increases. 

Again, f'{x) is positive whenever f{x) iucreascs, that 
is, the gradient steepens, which always implies a bend of 
the curve with its concavity upwards ; and/"(x) is negative 
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whenever f'(x) decreases, which happens when the bend 
of the curve has its concavity downwards. Each real 
zero of f''{x) corresponds to a 'point of inflexion on the 
curve, whenever f"'{x)^0. Such points separate the 
upward from the downward bends of the curve ; these 
bends naturally must occur alternately. Usually the 
tangent at an inflexion is not parallel to the axis of x, but 
when it is, the point is both inflexion and turning point. 
The origin in the curve y = a:® is a good example of such a 
point ; and the reader should plot and draw the curve for 
inspection. Other turning points, for which /'(») = 0, 
f''(x)^0 are classified as maxima or minima points on the 
curve : the maxima are at the crests of the waves and the 
minima at the troughs. 

Always reading from left to right (that is, with increasing 
x) we may sum up as follows : 


The curve y=f{z) rises when f'{x)>0, 


faUs 

has a turning point 
a maximum 
a minimum 
is concave downw;ards 
is concave upwards 
has a point of inflexion 


f'{x)<0, 

f'{x) = 0, 

fix) = 0,f{x)<0, 

fix) := 0,rix)>0, 

f'ixXO, 

f"ix)>0, 

/"(r)=0,/'"(^)9^0. 


1. Prove that the curve y = x^—x^ — 4.x + 4: of 8 has a 

maximum between C and D, an inflexion between D and 
E, and a minimum between E and F. (Maximum at 
X = —0-87, inflexion at x = and a minimum at 

cr = 1-54.) 

2. The curve for which ~ + has no 

inflexions. 

3. The curve y = x^+x^-\-x--\-ax~\-b has no inflexions: 

while y = x^ -\-x^ -{-ax has two infioxions. 

4. Examine the case when ])oth/'(:t*) ™ 0,j'"{x) == 0. 


11. Behaviour of a Polynomial at Infinity. It is 
useful to consider the behaviour of the polynomial f{x) 
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for large values of x, positive or negative. For this purpose 
let the function he provisionally written as 

/(ic) = (1) 

where the sign of each term is fixed, either or — , and 
each of a^, is either positive or zero. If a;>0 

each power of x is positive, and the value of j{x) obviously 
is not greater than 

-\-an . . ( 2 ) 

nor less than 

— a„. . . (3) 

Also if x>l, then a:">a:”~i>a;”-2>...>a;>l. Hence 
is less than whenever r — 2, Z, Conse- 

quently, if a:>l, 

>/(») > Ofla;”— (ai+aa+---<^n)‘'«”~^- 


Hence, if A = we have 

{aQ-\-Ajx)x'^ >fix) > {aQ—Ajx)x'^. . . (4) 

Now choose a positive constant e, however small. Since 
A is finite and positive, we can ensure that Ajz<e by 
taking x>Al€ ; that is, by taking x large enough we 
ensure that f{x) lies between the values = (ao-{-e)a;" 
and 2/2 = (aQ—e)z‘^. 

Geometrically, the graph of y must lie between those 
of 2/1 and 2/2 large enough values of x. Let the graph 
of 2/3 be drawn, where 2/3 = u-o*", this being the limit of 
both 2/1 and when e— >0. The graphs 2/1, y^, 2/3 then form 
two very acute curvilinear angles running from the origin 
and bending upwards into the first quadrant if aQ>0, 
and downwards into the fourth quadrant if ao<0. Also 
yi is uppermost, y^ lowest and y^ between. For all largo 
enough values of x the curve y is between the uppermost 
and the lowest of these boundaries — ^the shaded area of 
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the figure. By changing e to or any such proper fraction 
of itself we decrease y^, increase and leave y^ unchanged. 
As e-^0 the graph y approximates more and more closely 
to that of y^, which acts as a curvilinear asymptote to y. 

Analytically we say that for large values of x the 
function f{z) behaves like a(pcA, its leading term. 



Similarly, if x is negative and large, f{x) behaves like 
OqX”'. If n is odd, x negative and positive, then 
is negative, so that the curve is situated in the third 
quadrant. If n is even the curve is situated, for large 
negative values of x, in the second quadrant. 

Example. f{x) = — x^—4x-j-4. 

Here the sum of the absolute values of all coefficients 
except the first gives A — 1+4+4 = 9, so that f(x) lies 
between (l~e)x® and (l+e)^;® whenever z>dje or < —9/e. 

12, The Taylor Expansion of a Polynomial. A 

particular case of an important theorem based on repeated 
differentiation, first employed by James Gregory in 1670 
but first published by Brook Taylor in 1715, is the following 
identity for a polynomial f{x) of degree n : 

fix+h)=f{x)+hf'ix)+hT(^)l2\+...+h»f(^^{x)ln^^^ ( 1 ) 

where (a:) denotes the r** derivative oif{x). 



22 


POLYNOMIALS 


Proof. The polynomial f{x) is the sum of terms of 
type Putting f{x) = a^x'^ in the right-hand side 

of (1) and making use of 

^ x'^ = r{r—l)...(r—k-j-l)x‘^-^, 
dx^ 

we obtain 

{x^+hrx^-^+h\2^x^-^+.. .h^r^x^-”}, 

where denotes, as in 9, p. IS, the binomial coefficient 
r(r— ^+1)/^!- Pu-t fhis is the binomial expansion 
of aj.{x-\-hY. Hence the theorem in question is true for 
each term of the polynomial f{x) ; and so, by addition of 
terms, for the polynomial f{x) itself. 

Examisles. 1. If / («) = ax^+Zbx^ -\-Zcx-\-d evaluate/ (rc+A) 
and /(a:— 6/a). 

2. If /(a;) = a:®— 2a;® + 3a;— 4 evaluate /(a; +2) and f{x—l). 

13. Identities and Equations. We must distinguish 
carefully between an identity and an equation. When 
a polynomial or other function f{x) is thrown into another 
form j>{x) the relation f{x) == f>{x) is called an identity, as 
for example 

2x2-3a;+l = (2a;-l)(a:-l). 

An identity is characterized by the obvious property 
that when it is simplified to its basic terms they all vanish. 
Identities range from arithmetical cases, when no variable 
X appears, to highly elaborate cases involving many 
variables. It is also obvious that an identity is a relation 
which is true for every finite value of such variables. In 
the above instance, whatever finite value x has, the relation 
is true. But a much more important and converse fact 
holds, namely, that if 

f{x) = aQX'>^+aiX'^-'^-\-...+a^ = 0 
is true for every finite value of x then f{x) — 0 is an identity. 
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First Proof. This proceeds by the use of determinants. 
Choose unequal numbers a, k and form the 

alternant 




in a notation where 


m = 


II’ 


ja^^yi = 


|a‘ 

la 


^2 y 2 | 

^ y I 
1 1 1 


and so on. Then /(a) = = 0, and we have 

n-{-l homogeneous linear equations /(a) = 0, /(jS) = 0, ..., 
y(/c) = 0 for the a,,, %, ..., in terms of a, ..., k. By 
the theory of linear equations (Aitken, Determinants and 
Matrices, p. 64) either A = 0 or else all the vanish. 
But A is equal {Ibid., p. 41) to the continued product of 
the \n{n-{-\) differences of a, k taken in pairs, and 

so cannot vanish, since aU of a, j8, ..., k differ. Hence 
all the a^ vanish, so that f{x) = 0 is an identity. 

A corollary follows at once, that if f{x) is a polynomial 
of degree n which vanishes for %+l distinct values of x it 
vanishes identically for every value of x. 

The argument given above is so important that it will 
be worth while to illustrate it for the case of the cubic 
agX^-\~aiX^-\-a2X+as — 0 . 

If a, y, S were four distinct roots then 

cf'Qa^ ”1" -h a^d "h = 0 

+ «2^ + Us = 0 

O'oV® + O’ly^ + + <^-3 ~ 6 

CloS® + OiS® + GSoS -|- 0-3 = 0. 


On eliminating Op, a^, a^, from these equations we have 

] a.® a® a 1 | 

|S 1 

3 2 1 

y y y i 

8 ^ 8 - S 1 


A(a^y 8 ) 


0 . 


But Zl(aj8yS) ~ (a — /8)(a--y)(a — 8)()8™y)(j8--S)(y — S), and so, 
since Zl = 0, two roots at least must bo equal, contrary to 
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hypothesis. It follows that the assumption is wrong, and 
that the cubic equation cannot have more than three distinct 
roots. In the same way an n-ic polynomial equation cannot 
have more than n distinct roots. 

Second Proof. If /(x) = 0 for every value of x, f{z) 
neither increases nor decreases in any range whatever, 
and so /'(x) = 0 for every value of x. But then, by the 
same reasoning, f"{x) — 0 for every value of x, and hence 
again /"'(x) = 0, and likewise all derivatives vanish. But 
since the Taylor series of /(x) with respect to x = 0 is 

f(x) = /(O) +x/'(0) +x2/"(0)/2! +. . . +x»/<«> (0)/n!, 
we see that 

an=m, a^-1 =/'( 0 ), =/"( 0 )/ 2 !, ..., =/<”>( 0 )/»!. 

Hence all the must be zero. 

It follows from this theorem that if /(x) = cf>{x) is an 
identity, true for every value of x, we may equate the 
coefficient of each power x^ on the left to that of x*" on the 
right. In fact we merely write /(x) = 0 and apply 

the theorem. From an algebraic identity involving 
polynomials of the degree we thus obtain n+\ arith- 
metical identities. The identities so obtained may range 
from the most obvious tautologies to highly elaborate 
arithmetical propositions. 

Unless all the aj- vanish /(x) = 0 is an equation of degree 
given by the index of the highest power of x whose co- 
efficient does not vanish. Hence, by the above theorem, 
an equation of the degree cannot have more than 
n roots. 

A polynomial identity can bo described as having dogroe n 
when terms up to the power x" occur in it. If so the identity 
must be taken at its face value — the descriptioii is given 
before simplification is undertaken. Thus 

(x-f l)(x-l)-x2-f- 1 = 0 
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is a quadratic identity. On the other hand, 
{x-\-l){x-l)—x^->r2x = 0 
is not a quadratic but a linear equation. 

14. The Practical Evaluation of a Polynomial. 

It is manifestly of importance to acquire facility in 
calculating the value of f{x) for special values of x. 
How, for example, can we best evaluate /(4) when 
f{x) = 2x^—3x^-\-4:X—5 ? Instead of direct substitution, 
which is often troublesome and liable to errors, the 
following process, introduced in 1819 by Horner, is to be 
recommended, particularly as it furnishes the technique 
required also for the actual numerical solution of the 
equation f{x) — 0. The work is arranged as follows : 

f(x) — 2a;®— 3a;‘^+4a;— 5. 

2 —3 +4 -5 (4 

8 20 96 /(4) = 91. 

6 24 91 

The coefficients 2, —3, 4, —5 with their proper signs 
are placed in, a row according to descending powers of x. 
Missing powers must be mdicated by zero coefficients. 
Thus we have four columns (% + l iii general). The 
argument 4 of the desired /(4) is then entered, as if it were 
the quotient of a long division sum, beyond the final 
column. Beginning at the left the leading entry 2 is now 
multiplied by 4, the product, 8, being written below the 
next entry on the right and added. The result 5 is again 
multiplied by 4 and the product, 20, is added to the 
coefficient in the third column, giving 24. This again is 
multiplied by 4 and the product, 96, is added to the 
coefficient in the fourth, the last column. The process is 
then complete and the result is/(4) = 91. 

The explanation is simple : and it is only necessary 
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to carry out the same procedure with letters to convince 
ourselves of its correctness : — 

a b c d (x 

ax ax^-{~bx ax^ -\-bx^ -\-cx 
ax+h ax^+bx-\-c ax^-\-bx'^+cx+d = f{x). 

According to the rule the last entry in the final column, 
ax^-\-bx^-\-cx+d, should be the value of the polynomial 
whose coefficients in order are a, b, c, d and whose argument 
is X, and this is indeed the case. The method depends on 
the simple identity 

f{x) — {{ax-\-b)x-\-c)x-\-d, 

where brackets are introduced so as to resolve all powers 
of X into single factors. This can obviously be done for 
the general polynomial of degree n. 

The reader will find it instructive to divide 
2x^—3x^-i-4:X~-5 by a:— 4, using long division. The 
quotient is 2x^-\-5x-{-24: and the remainder 91. Horner’s 
method (sometimes called the method of S3mthetic division) 
provides all the materials for the quotient and remainder 
more compactly than the ordinary method : and it has 
the psychological advantage of employing addition rather 
than subtraction as the staple operation. 

Horner’s method yields the quotient and remainder 
at once whenever a polynomial f{x) is divided by a linear 
divisor x~a. For the more general divisor ax+jS, divide 
f{x) first by a and then by x-{-^/a. This yields the correct 
quotient, but the remainder B. is relative to z+jS/a. It 
must be remultiplied by a to give the correct remainder. 

The method does not apply to quadratic and higher 
divisors. 

Examples. Iff(x) = find/( — 3). 

1—20 00 3 (—3 

-3 15 -45 135 —405 

-5 15 -45 135 -402 =/(-3). 

Find a]Ro/(4),/(0-2),/(3), and try other values of x. 
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15. The Graphical Method of Lill. An interest- 
ing graphical method of evaluating /(a), devised by Liil, 
has an advantage when the coefficients of the polynomial 
are awkward decimals, but an approximate result alone 
is required. Squared paper is useful. 

We lay off straight lines AB, BO, CD, DE, ... of lengths 
equal to the respective coefi3.cients cq, a^, a^, .... At 
each point B, C, D, ... there is a right angle corner between 
consecutive lines, to the right if the next coefficient has 
the same sign and to the left if the sign changes. The 
whole track ABODE... may be regarded as a plan of a 
route through a rectangular system of streets. We shall 
suppose that none of the coefficients vanish, so that there 
are %+l segments and n corners to the route. Such a 
graph is clearly a representation of the function, a new 
type but none the less a representation. 

Now take an acute angle 6, such that tan 6 = x: 
then one such angle corresponds to a given real x, positive 
or negative, and the sign of 6 is the same as that of x. 
Draw a line AP meeting BC (produced if necessary) at 

P, and such that the angle BAP is d. If AB is set 
off horizontally from left to right, then P falls below 
AB when x is negative and above v/hen x is positive. 
Draw a broken line APQR... such that the angles P, 

Q, ... are right angles situated on the lines BC, CD, DE, ... 
respectively. Then the value of f{x) is given by the 
distance RE, from the last such point R to E, the; end of 
the final segment This distance is measured according 
to the sense of the distance DE : thus in the first illustration 
DE is +4, the direction of measurement is from D to E, 
hence RE is positive. In the second illustration DE is 
—3, therefore RE, having the same sign as DE, is negative. 

To prove the result we work out the segments step by 
step. Thus in the first figure AB = BP = tan 6. 
PC = ai-fUo ^ fail hence 

QD = CD — CQ = Ugfi” (ffi 
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D Q, C 

a:S+2£t’2 + 3x+4. 

No change of sign : all comers to the right. 
{0 = —45°, X = — 1). Compare Ex. 8, p. 33. 


C Cl D 



V — 2.X’2 — Sk — 3. 

One change of sign : first corner to the left. 
(^CO, tan 0 = ,'i; '= — J). 



GRAPHICAL METHOD 


J'inally, RE = DE — DR = DE — DQ taji ft 
= ttg+QD X " 


The process can be applied 
to aU such cases and for all values 
of n. 

Evidently the equation is 
solved whenever the point R 
coincides vdth E, for in this case 
RE vanishes. It requires but 
little practice to see that this 
would happen, in our first illus- 
trative example, when 6 is rather 
more than 45°. This furnishes 
an interesting and ingenious 
method for locating the roots of 
an equation. 



Examples. 1. Adapt the rule 
to the case where one or more of | 
the intermediate coefficients are E 
lacking. x^+2x—5.' 

Proceed as before but make the 
necessary points coincide. For x^+2x — 5 we have AB = . 
CD = 2, DE = 5. Whether BC is^regarded as -fO or 



A 
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the rule requires CD to be in BA produced, a right-about or 
a lefb-about turn. 

2. Justify the preceding figures for x^+2x—5 and for 
x^-\-2x^ — 5, where /(a;) is represented by RE with a; = 1. 

3. Show that x^+2x—5 — 0 has a root which is slightly 
greater than unity. (Move P slightly upwards to cause R to 
coincide with E.) 

4. Draw the zigzag graphs ABC... for a:®— 4, a;®-|-8, 
x^+x^+2x^+2x + l. 

5. Show from the zigzag that 2a;*— 2a;® + 4a:®— 4a: -4-1 = 0 
has a root between 0 and 1. 

16. Horner's Method of Reducing an Algebraic 
Equation. For the purpose of solving a numerical 
equation by successive approximations — a method which 
is exemplified in ordinary arithmetic when we calculate 
y'2 as 1-4142..., digit by digit — we need a practical way 
of expressing /(a:) as a polynomial g{y) where y = x-\-a. 

Th}^s f{x) = -{-an-iX+an . . (1) 

and 9'(y) = • ■ (2) 

are to be identically equal when y = x—a. The co- 
efificients are supposed known ; so too is a. Wo require 
to find the coefficients 

This can of course be done by Taylor’s theorem ; in 
fact 

9'(2/) =/(*) =/(2/+a) =/(a)-f?//'(a)-4-...+y"/<”>(a)/»!, (3) 

which exhibits /(a:) as a polynomial in y of degree n. Hence 
bn — /(«■)) ^w-i = /'(a) and so on. Incidentally this shows 
that the degree n is the same both for g{y) in y and f{x) 
in X. Theoretically, therefore, the question is solved, but 
we stiU need a convenient practical way of calculating the 
coefficients b^. This is done by Horner’s method. 

Suppose, for example, that we wish to express 

f{x) -= 2a;*-3a:®-i-4a;2-5a;-f 0 

in the form 

bfj^x 2 )*- 4 - 6 j(a: — 2)^-\-b^{x — 2i)^-{-b^{x — 2 ) 4 - 64 . 
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We apply Horner’s metliod, dividing f{x) by x~2. From 
tbe Horner scheme 

2 —3 4 —5 6 (2 

4 2 ^ 14 

r 6 7 20 

we infer that the quotient is 2a:®+a;®+6a:+7 and that the 
remainder is 20. In fact 

2 a:*— 5a;+6 = 

The reader will find it instructive to reverse the process 
and start by multiplying out this identity. The above 
scheme will then be seen from another point of view. 

Now let 2x^-l-x^-j-6x-{-7 be treated similarly and divided 
by £c— 2. From the scheme 

2 1 6 7 (2 

4 W ^ 

6 16 39 

we infer that 2a;®+a:^+6a:+7 = (2x^-j-5x-j-16)(x—2)-i-39. 
Again, from 

2 5 16(2 " 

4 m 
"9 34 

we infer that 2a:2+5a:+16 = (2a;4-9)(a:— 2)-b34, 
and finally from 

2 9(2 

4 

13 

that 2a;-f-9 = 2(a:— 2)+13. 

We can telescope these results and write 
2x*-3a:3+4a:2-5a;+6 = (((2y+13)2^+34)?/+39)?/+20 

= 2y^-\~\3y^ -\-39y -\~20, 
where y — x—2, and this is the required form. In fact 
= 2, = 13, 62 = 34, 63 = 39, 64 = 20. 



32 


POLYNOMIALS 


We notice that the successive remainders have furnished 
the coefficients b from right to left, except for which is 
the same as Clearly too the process is true in general : 
the hi are the remainders on dividing by y (= a:— a) first 
/(a;) and then its successive quotients. 

Furthermore — and this is the chief advantage of the 
method — we can tabulate the whole process in one 


scheme : 



2 ~3 

4 

-6 6 (2 

4 

2 

12 14 

1 

6 

7 20 

4 

10 

32 

5 

16 

'39 

4 

18 

f(x) = 2a;*— 3x®+4a:2— S.T+fi 

9 

134 

= 2^*+13yH342/2+392^+20, 

4 


where y — x—2. 

13 



It wiU be 

seen 

that this scheme incorporates all that has 


been said, but without needless repetition. The method 
is easy to memorize directly from such a scheme. Build 
the scheme from left to right by rows. At each step 
multiply the balance in a column by a{a = 2 above) and 
add it to the balance in the next column to the right. 
Perform the step once in the last column, twice in the last 
but one, and so on. Rule off with a vertical line at the 
completion of each stage. 


Examples. 

1 4 

£ 

7 

_3 

10 

3 


1 . 

0 

21 

30 

m 


-70 (3 
£3 

f(x) = x^+4x^~70 
^,f + 13y^+51y 
where y ~ x—Z. 


-7, 


13 
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2. Prove by Horner’s method that 

== (a; + l)4^4(a; + l)3 + 6(a7 + l)2--.4(ic + l). 

3. Express /(rr) as a polynomial in y when 

f(x) = — and y = x — Z ; also when y = x + Z, 

4. From Example 1 ohtojm f'{Z), f''{Z),f'''{Z). (51, 26, 6), 

5. Evaluate (2x^ — 3x^+5x^ — 6x + 7) ~ (x — 2), -r (x-i-2), 
and again (ia? + 3). 

6. Obtain a, h, c, d, e,/ from the identity 

x^-\-x^ + l = (a; — 2)® +a(a7— 2)®+6(a;-— 2)^ 

+c(a? — 2Y+d(x — 2)^-\-e{x — 2) +/• 

7 . If x^ — 4:X^-{~Zx — l = 

(x~~l)(x-‘2)(x-Z)+a(x-^l)(x — 2)+b{x-^l)+c, 
find a, b, c. 

-4 8 ->1 (1, 2, 3 

1 --3 5 

-~3 5 4 = c 

2 ^ 

^ ~3 = 6 

3 

2 = a 


x^ — 2x^+Zx—4c. 
Three changes of sign : all 
corners to the left. 

(d = 45°, X = 1). 

Draw the track when x == 

9, Heduce £4, 2s, 8d. to pence by Horner’s Method. 

4 2 8 

82 992 

Thus Horner’s Method is simply the systematic use of a very 
familiar process in elementary arithmetic. 
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CHAPTER III 


THE THEORY OF RATIONAL FUNCTIONS 

17. The Division of Polynomials. The arithmetical 
process of division plays two distinct r61es : it is either 
a sharing, or else a measuring, process. To divide 28 inches 
into 7 equal parts is a sharing process, to find out how 
many times a length 7 inches is contained in 28 inches 
is a measuring process, but the same arithmetical symbol 
28-^7 is used for both. When the exact multiple 28 of 7 
is replaced by a number such as 23, which is not a multiple 
of 7, the distinction is more evident : sharing leads to a 
proper fraction 2/7 as remainder, while measuring leads to 
a whole number remainder 2. The quotient 3 is the 
same in either case. 

23-7; 23/7 = 3+2/7, 23 = 3x7+2. 

Measuring division may be regarded as repeated subtrac- 
tion, proceeding just so far as to produce a zero, or else 
a final remainder which is less than the divisor. 
Subtracting 7 step by step, we have 23, 16, 9, 2. The 
number of terms which are not less than the divisor gives 
the quotient, while the term, if any, which is less than 
the divisor is the remainder. 

If D — dividend, d = divisor, g = quotient, 
r = remainder, then we can state these facts in the form 

D-rd ; Djd = q-i-r/d, D — qd-{-r. 

When D and d are positive integers, so also are q and r, 
with the important condition that Q^r<d. 

In algebra these statements are closely paralleled by 
the properties of polynomial division. A whole number 
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such as 384 is really a quadratic polynomial in respect 
to ten : 384 = 3£c^+8r+4, when x = 10. A decimal 

terminating fraction such as 384*25 is really a rational 
function of ten, expressed as partial fractions, 

2 6 

384*25 = 3x^-{-8x-\-4:-i 1 r when x = 10. 

X x^ 

A non-terminating decimal is an infinite series in descend- 
ing powers of ten. Thus the polynomial /(x) in algebra 
plays the part of the whole number in arithmetic, while 
the rational function 

R{x) = fix)l(l>{x) 

composed of two polynomials, whose degrees are m and n 
respectively, plays the part of the arithmetical fraction. 

Let j{x) = a^i=^0, 

(f){x) = CqX^ Co=?^0, 

where aU the coefficients and are constants. If 
m'^n the function B{x) is called an improper, and if 
man a proper, fraction with regard to x. If m'^n we 
may divide /(«) by ^{x) and obtain 

f{x)l4>{x) = q{x)+r{x)l<f>{x), f{x) = q{x)(f>{x)+r{x), 

where q{x) is a polynomial of degree m — n, and r{x) is one 
of degree less than n. For example 

x*-^2x^ — a;+l ^ 2x^—5 

x^ — 2a;+3 x^ — 2a;-t-3’ 

x*-\-2x^—x+l — {x+2){x^—2x+Z)-{-{2x^—5). 

Let us note the following properties : (i) i2(a;) reduces 
to a polynomial when n is zero, or else when <l>(x) is a factor 
of/(a;), in which case r(x) vanishes identically. 

(ii) The highest term in q{x) is a^Cg-^x'^-'^, as is seen 
by actual division, so that the degree of q(x) is m—n. 

(iii) The relation f{x) = q{x)(f>(x) +r{x) is identically 
true for every value of x. 
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This division process is unique ; for if not, let 
and be another quotient and remainder of /-r^, so that 

f{x) ==" q 4 >-\-r = 

where, for brevity, q is written instead of q{x), and so on. 
Then {q—qi)<j> = r—r^ identically. If q^q^ the left-hand 
expression is a polynomial expression of degree at least 
that of (^, whereas the degree of both r and r-^ and therefore 
of their difference is less than that of (f>. This (c/. p. 24) 
is impossible : hence q = g,x,&o that r = and the division 
is unique. 

By Euclid’s method of repeated measuring, commonly 
called the G.C.M. or the H.C.E. process, we can discover 
whether f{x) and ^{x) possess a polynomial factor in 
common. Again, for brevity, let single letters denote 
polynomials. Divide f hy <j> giving remainder r : divide 
^ by r giving a remainder : divide r by giving a 
remainder and so on. Thus 

<l> = qir-i-ri, 

r = q2rx+r^, 


^ 35—2 

Each of /, <f), r, r^, ... is a polynomial, where the degree 
of r is less than that of <f>, that of is less than that of r, 
and so on. Consequently the process must terminate since 
the degree of ^ is finite. Either a remainder vanishes 
identically, or the degree of the last r,, is zero, that is, 
Tj, is a constant c^Q. 

Case 1. If rj, = 0 and all its predecessors are non-zero, 
then is a factor of r^^^. But r^^^ ^ 
so that is also a factor of r^_3, and so oii tliroughout 
all the equations. Thus r^.^ is a factor common to /, (f> 
and to each r. 

Conversely, any factor s common to / and 9S is a factor 
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of f—qtj) and therefore of r, similarly of that is 

of 7\, and so on. Hence includes every common factor 
of / and <f) : it must therefore be the highest common factor 
(H.C.F.) or greatest common measure (G.O.M.) of/ and 

Case 2. If the last remainder is c, a non-zero constant, 
it must include as before every common factor of / and (f>. 
Since c is free from x we say that, in this case, / and <f> are 
poljmomials which are prime to each other. They have 
no common factor. 

Hence anj’’ two polynomials are either prime to each 
other or else have a common polynomial factor. The 
above Euclidean process is unique and rational ; and it 
will inevitably detect any factor common to / and 
We denote the G.C.M. by G, and m Case 2 we take G = 1. 

Theorem 1. Each partial remainder r, including G, 
can be expressed iu the form where A and B are 

polynomials of degree less than <j> and / respectively. 

Proof. Solving the relation for r, r^, r^, ... in succession, 
we obtain 

r = f—qcf> = Aof-{-Bo4>, say, 

^"2 = (9'l2'2 + l)/~tol9'2+9'+9'2)9^ = ^2f+-®2'5^j 
and so on. In this way the row is obtained on multi- 
plying the preceding row by — and adding the last but 
one row. Hence each partial remainder is expressed in the 
form Af-\-B^, where A and B are polynomials. 

Again, by the mode of formation, A^^ the coefficient 
of / contains as highest term drQ'i3'a-"9'fcj while that of 
^Tc is i 9 ' 2 'i 9 ' 2 "- 9 'jfc- How from (1) the degree of / is that 
of g^, the degree of is that of q^r ; indeed the continued 
products 

have the same degree. Remove the common factors : 
therefore / and bave the same degree. But 

fv-i) being a remainder before the last, must involve x 
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and have a positive degree, so that the degree of 
is less than that of /. 

Once more, on applying the same argument to all 
hut the first equation of (1), the degree of g'i§' 2 - • •g'® is less 
than that of <^. 

Accordingly the degree of every A is less than that 
of <j}, and of every B is less than that of /. 

Corollary. Polynomials A and B exist for any pair of 
polynomials / and <f>, such that either Af-\-B(^ = Q ox 
Af-\-B4> = 1} where 0 is the G.C.M. of / and or else 
/ and ^ are prime to each other. 

The corollary is proved by applying the above theorem 
to the final non-zero remainder, v/hich is either or a 
constant c. In the latter case divide Af-\-Bcf) = c through- 
out by c and rename the polynomial coefficients of f and 
(f), as A and B. 

This theorem, with its corollary, is of fundamental 
importance in many branches of algebra. 

Examples. 1./ = »*+!> <f> — Here the successive 

division processes give 

a;® -pi = ] ) +(— where q = x, r — — aj+l, 

+ l == (— a3-l-l)(— a; — 1) -1-2, q^ — —x — \, == 2. 

Hence / and ^ are prime to each other. Also 

+ l = [a:=‘-|-l-a;(.^'2-f l)](-a:-l)-f2. 

Thus (a;-l-l)(a;®-l-l) a;— a;2)(a;--f- 1) = 2, so that 
dL( = i'(x-l-l)) is of degree less than that of (j), and 
B{ = |■(1— a;— .T^)) is of degree less than that of/. 

2. / = = x^ — 1. Here a;®-l-l = a;(a:® — l)-l-a3-}-l, 

x^ — 1 — (a: — l)(.'c-l-l). Thus G — a;-|-l, and (a;®-l-l)— .a’(a3“ — 1) 
== a;-fl shows that A = 1, B — ~x, whore A.f+B(l> — G. 

18. Reduction to Lowest Terms. A proper fraction 
p{x)jq{x) cannot be identically equal to a polynomial /(a;) ; 
otherwise p{x) = f{x)q{x) would bo an identity, where at 
least one term on the left is of higlier degree than any on 
the right and therefore (c/. p. 24) could not cancel out. 
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The combination, in sum or difference, of a finite 
number of algebraic proper fractions always gives a proper 
fraction : for 

q's ^ 

and the degrees of both ps and qr are less than the degree 
of qs when pjq and rjs are proper fractions. For n such 
terms, add them one by one, applying this argument 
each time. 

In individual arithmetical cases this property is not 
necessarily true. 


Example. 


aj® X 

a:3~+i’*"a;2 + l 


gives a single fraction whose 


degree is four in the numerator and five in the denominator. 
Put X = 1 •, the sum is not a proper fraction. 


Theorem 2. Iff is prime to ^ thenf/<f) cannot be reduced 
to terms of lower degree. 

Proof. If possible let fjcf) = fil4‘i, v/here /j is prime 
to Divide / by 9 ^, and by : let the respective 
remainders be r, Then 

q<f>-\-r _ qifi+r^ 


or 





where q~qi is a polynomial and the right-hand expression is 
a proper fraction. Hence both vanish, so that r/(j 6 = 
Hence 


fifi = = r/ri = ... . 


This process has reduced to a fi'action rlr^^ of lower 
degree in both numerator and denominator. Let it 
therefore be repeated until, say, is reached, where 

either or rj.+i is a constant which may be taken as unity. 
If is a constant and Tj. is not, then / = fp-j^, (f> = 
shows / and <f> both factorized, whereas by hypothesis f 
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was prime to (jy. Similarly, rj. cannot be a constant while 
is not ; so that both are constant, and so too is 
= K say. Thus / = A/^, = A^i and fl4> can 

differ from fil<^x by a constant factor only in numerator 
and denominator. Thus fjcfy cannot be reduced to terms 
of lower degree. 

Theorem 3. The form = (? or 1 is unique, 

where A and B are lower in degree than and fjG 
respectively. 

Proof. Iff and cf> have a common factor, write / = Gf^, 
(j) = G<f}j, and treat the prime case Afx-{-B<f)x = 1 . 

Assume now that / is prime to (/>, and that a second 
such identity Cf-{-Df> = 1 exists. By division express G 
as Qf>A-Ag and D as Pf-i-B^, where Aq is of degree less 
than that of (f> and Bq less than that of /. Hence 

iAo-{-Q<f>)f-\-{Bo-\-Pf]f> =1. . . (2) 

This is an identity, where f(f> is of degree m-\-n, A^f is of 
lower degree, and so is B^cf). Unless P+Q = 0, the identity 
contains terms {P-\-Q)f^ of degree at least m+% which 
cannot vanish. This is impossible (c/. p. 24). Hence 
P+Q = 0 and Agf+B^fi — 1 . 

Once more, Aq and Bq must be the A and B of the 
Theorem 1 ; for if not we have 

so that {Afx~A)f — (B—Bf}(^. Unless A = A^, B — Bq 
this reduces the fraction //^ to lower terms {B —Bq) /{Aq —A), 
which is impossible when / is prime to c/y : thxis A — Aq, 
B — Bq and the formula Af-{-B<j> — 1 is unique. 

19 . Partial Fractions. Let f>{x) be capable of 
breaking up into two polynomial factors 9 & 1 , <^2 which 
are prime to each other. Then, by the Theorem 1 , we 
can find polynomials (or constants) A^ and A 2 , of 
lower degrees than and <j>^ respectively, such that 
This at once enables us to resolve 
f{x)j<f>{x) into partial fractions ; for 

/(^) /(g ) {A 2^)1 Ax<f>f) -^if A2f 

4'i 


( 1 ) 
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Oa performing the division, if improper fractions occur 
in either of the pair, we obtain 


m 


= q{x) + 


!iM 


rJx) 


( 2 ) 


where q{x) is the total quotient, and. r^, are the respective 
remainders, for the divisors and 


Example. 


^ ^ 33—1 1 

= 2 a ;— 2 1 - . 

(a:^ -1-1) (33 + 1) cc^-l-l 33-|-l 


The reader will be famihar with simple examples of the 
method of partial fractions. Certain observations can 
therefore be shortly stated : (i) any common factors of 
/ and ^ are first removed before resolution into partial 
fractions is attempted : (ii) the quotient q{x) is of degree 
m — n and only appears if m, the degree of/, is not less 
than n, that of ^ : (iii) and iJiust have no common 
factor, although each can be of any degree : (iv) if <f}^ 
is of degree p the numerator r^^ix) is a polynomial of degree 
less than p. 

If either cfji or <^2 ^^.s polynomial factors the process 
may be repeated, until all the distinct factors of ^ are 
segregated. Thus we express the rational function It{x) 
as a sum of partial fractions in the form 


B(x) = 


M 

^{x) 


= q{x) -f- S 


r{x) 


(3) 


where no two of the denominators ?/r(a:) have any factor 
in common, and each r{x) is of lower degree than its 
denominator. 

Various cases arise : 

(i) The simple case, when <f>{x) — {z—a){x—^)...{x—X) 
is a product of n distinct linear factors in which a, ..., A 
all differ. We write ^( 33 ) ■*=i7(x— a). 

(ii) The repeated factor case, when 

(l){x) ~ (a;— a) ’■(03-/3)®... , 
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where a, ... all differ but one or more of r, s, ... exceeds 
unity. We write ^{z) = IJ{z—ay. 

(iii) The simple quadratic case, when (f>{x) has quadratic 
factors z^-\-px-{-q, with or without linear factors, and 
p^<,4:q. 

(iv) The repeated quadratic case. 

In any case the polynomial quotient q{z) must be 
evaluated by an ordinary division process. In the simple 
case, (i), each partial fraction will take the form 
ajix—a), where the denominator is linear and the 
numerator, being of lower degree, is necessarily a constant 
only. Hence we have an identity 


fj^ 

<f>ix) 


= • 

X — a 


To evaluate a, multiply throughout by x—a. Every 
term on the right will then contain x—a as a factor except 
the term a itself. Put x = a and the right-hand member 
is a alone. On the left we shall have 


/(a)/{(a— (a— y) . . . (a— A)}, 


which must therefore be equal to a. Similarly for each 
numerator of the partial fractions. 


Examples. 1. , = z+S+ -) — ^ 

X^ — SX + 2 ;t,: — 1 x — 2 

Here x^—3x-\-2 = {x — l)(a: — 2) and q(x) = .t+S by division. 
Multiply through by .t — 1 and then put a: — 1. Thus 

1 8 

= a. Similarly, - — - = b, so that a — — 1, 6 = 8. 

i — At — J. 

2. If ^(x) = {x—(x){x~^){x—y) whoro a, jS, y differ, prove 
that 

+. + * 


3 . ■«“> 

j,{x) ^ {x-A)4'{X) 
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where (l>{x) = {x—a){x — ^)..,{x^X), lff{x) is of lower degree 
than <l>(x), the quotient q{x) vanishes. 

To prove this, observe that ^'(x) = (x — ^){x~~y)..,{x—X) 
plus terms all involving x — a as a factor. Hence (l>'{a) 
= (a-“jS)(a— A). Hence the value of a in the partial 
fraction aj(x — a) is/(a)/^'(a), and the result follows. 

4. JIJ{x) is a cubic polynomial and a, y, S are any four 
distinct numbers, show that 


f{x) = Sf{a) 


ix--^){x~y){x-8) 
(a — j8)(a — y)(a~-S) 


summed for the four terms due to the distinct combinations 
jgyS, ySa, SajS, a^y. 


This follows from Example 3. It is Lagrange’s inter- 
polation formula for a cubic polynomial, and it applies in 
general, with the necessary changes, to the ?^-ic. 


Case (ii). 
gives 


If <p{x) = {X’--ay{x—^Y ... the general rule 


f(x) 

cl){x) 


= qix)-\-2 


A 

(x—ay 


where ^ is a polynomial of degree less than r. 

Put X — a = y and express A by Horner’s method as 
a polynomial PxA'P^yA-Pz'y^A'--- degree less than'r. 
The partial fraction due to x—a. is then 
which separates at once into the terms 


53l _ I Pz 
(x—aY (cc— a)’"’- 


Pr 

X — a’ 


where the numerators are constants. At most there are 
n such partial fractions ; in particular cases some, after 
the first, may vanish. Similarly, here are at most s terms 
due to B, and so on. 

In practice the calculation of the numerators is trouble- 
some whenever r>2 ; but p^ is obtained at once on 
multiplying throughout by (x—aY and then putting 
X = a. If r — 2, multiply throughout by {x—aY, 
differentiate with respect to x and then put x = a. This 



44 


RATIONAL FUNCTIONS 


gives ^2- higher cases multiplication and further 
differentiation peld the successive numerators of the 
partial fractions in question, but probably Horner’s 
method is more rapid than this. In some cases equating 
of coefficients gives the results quite rapidly. 


Examples. 1. 

a:® p g r s 

(£t;+2)®(a;®— 1) {x+2)^'^x+2 + 

Hence ■ ^ = p+{x-\-2)q+{x-]r2)\...'\. 

— i 


First put X = —2 ; then —8/3 == p. Next differentiate : 
the third term on the right then becomes 

2(a;+2)[...]+(rr + 2)2[...r, 


where the brackets indicate fractions with in the 

denominators to first or second powers. On putting cr = — 2 
these denominators do not vanish. Hence the third term 
(rr+2)^[...] vanishes after differentiation without the need 
for calculating explicitly the expression in the bracket. Thus 


. 2x 


g+(a!+2){...}, 


giving q = 4/9, Also r = 1/18, 5 = f , 

Alternatively, we might form relations between the un- 
known coefficients by substituting other special values of x. 
Here cr = 0 gives = 0, so that when the values 

of p, T and s are found as before we obtain ^ = 4/9. 

^ ^11 
■ (a:+2)%r®-l) ■“ (a:+2)« 162(a:-l) "''2(a; + l)’ 

where A is a cubic in x and the other fractions liave been 
evaluated by the ordinary method. Put x+-2 
A = multiply throughout by the 

denominator involving x. Thus 

(2/ -2)8 =:[pi +pa2/ +P32/" +P4,y^]{y^-^y +3) +2/^ 


'// — 1 y 3' 

-—4 _ . 
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Now pick out coefficients of y. We find^j^ = — 8/3, — 4/9, 
Pa = —14/27, = —41/81 most easily from the coefficients 


of 2 / 0 , 2 /^ 


2/^, 2/®> and can use 2/®, y® for checking the result. 


ax-\-h 

Case (iii). Quadratic partial fractions -;r- ; — are 

retained if we wish, to avoid complex numbers in the 
case when p^<4g'. Here there are two real undetermined 
coefficients in the numerator. Paradoxically the values 
of a and h are readily found by resorting to complex roots 
and proceeding as in (i). Multiply throughout by x—a—i^ 
where a+ijS is a complex root of = 0 and then 

put X = a+ijS. 

a:* A ax-\-h 

xamp e. ^x-\-\Y{x‘^-\-4x-\-5) (ai + l)® »®+4cc + 5* 


Here .4 is a quadratic in x. The complex factors of a;^+4a;+6 
are (a;+2+i)(a3+2— i). Multiply by x+2+i and put 
X = —2—i. Then 

(— 2— i)® a(— 2— ■i)+& 


This reduces, since = —1, to 9/4 — 13*/4 = b — 2a — ai. 

Since a and b are real, we have b—2a = 9/4, a = 13/4, 

13 9 7 

and so 6 = 35/4. Also A = — x^ — - x — -. 

4 2 4 

Case (iv). The real partial fraction r — may 

^ ^ {x^+px+qY 

occur, where p 4g. Since the denominator is of degree 2r, 
that of A may be 2r — 1 or less. On dividing A repeatedly 

by X = x^-\-px-\-<l, let the remainders be R^, Then 

the fraction can be written 



R 


r-1 




+...+ 


% 

X 


Such remainders R are linear and real. Hence the original 
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rational function lias been reduced to a sum of terms of 
the following possible types : 

1^0' o, ax-{-b ax-^b 

x—a’ (x—a)’’’ x'‘‘-\-'px-\-q' {x'‘‘-\-<px-\-<^‘^' 

If the coefficients in the original function f(x)lcl>{x) are 
complex, cases (iii) and (iv) are unimportant. They are 
significant when we wish to express a real function in its 
simplest partial fractions. 

The following examples suggest further aspects of the 
subject. 


2x _ 1 ■ 

x^+4: x+2i 

2 

■ (x-l){x^-3) 


1 

x—2i * 


8x 2 2 

4x^ — 3 2x-^VS 2x—V3 


ax-{-b 

x^-^3 




find the constants a, 6, c. 


This is the form to use if we wish to avoid irrational real 
numbers. Multiply by — 1 and put x = I ; then c — — 2. 
Multiply by 07 — V 3 and then put a? = ^3. Hence 

5 — V 3 aVS+ft _ _ _ 

(Vi-Ti5V3 “ (V3-I)(aV3+!.). 

or 5-\-b—Za = \/3{l-\-b~-a). From x-\-^/3 we get the 
same result but with — VS for +V3. Hence each side of 
this equation vanishes separately and we have a — 2, 6 = 1. 

3. If /(a;) is of lower degree than ^(jc) — {x—a){x—^){x—Y), 
where a, y are distinct, prove that 

/(«) M) fM 

/(^) x — a X — l X — y ^ 

<f>(x) ay 

1 1 


4. If a = ^zjLy, prove the result corresponding to the 
above, where the second column of each determinant is 
replaced by the derivative of the first column with respect 

f 3 f( ') 

to a; that is, these columns become -I — , 1, oi and 

\da x — a J 


{2a, 1, 0} respectively. 
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20. Determinantal Form of Partial Fractions. 

If f{x) is a polynomial of degree less than that of 
which is supposed to have no repeated factors, we can 
resolve /(a:) (a:) into the form 


cj>{x) 


1 1 ... 1 

a ^ ... A 

a® ...A2 


... Xn-2 

/(oO /(^) —/(A) 

-a X — B ••• * — 


1 1 ... 1 

a B ... A 

a2 ... A2 


^n-2 __ \n-2 

^n-1 ^n-1 __ 


For the denominator determinant, A say, is the n- 
rowed alternant (Aitken, Determinants and Matrices, p. 41) 
which has the ^n{n — 1) linear factors such as jS— a. In fact 

A = A{aB...K\) = {B-a){y~a)...{X-a) 


X (A — k). 

Also the numerator determinant may be expanded 
according to its final row as 

^ (3) 

x—a 

the summation being of n terms corresponding to a, /3, 
..., A, where A is an alternant of w — 1 letters. Hence the 
right-hand side of (1) possesses the typical term 

A{aB---KX){x—X) (A~a)(A— jS)...(A— /c)(a:— A) 

whidh. is precisely the term given by the last partial 
fraction oi f{x)lcf){x). Symmetry shows that all the partial 
fractions are obtained in this way. 
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21. The Confluent Case of Partial Fractions. Let 

^{x) = {x—aY{x—^Y..., . . ( 1 ) 

where a, j8, ... are all distinct. We call this the confluent 
case, where r of the roots a may be regarded as originally 
distinct but tending to equality, s further roots ^ tending 
to equality, and so on. If in the above determinants 
two roots are made equal the quotient is indeterminate, 
but if in the second column of each determinant a is 
replaced by a+Ji and h is allowed to tend to zero, the 
quotient of the determinants may be evaluated. When 
the first r roots are confluent and equal to a we take 

8u 1 d^u d‘^~^u 

2! (r-1)! 0a^-i 


to be the first r elements of any row, in numerator and 
denominator determinants, where u denotes the original 
entry in the first column on the row in question ; that is, 
we differentiate each column successively and divide by 
a suitable factorial as in Taylor’s expansion. The next 
s columns are formed in the same way afresh from a new 
root jS ; and so on. For example, if f{x) is a cubic or 
lower polynomial, 

Si^) 




5^. 

CO 

1 

-iS) = 

- 

1 

, 

, 

1 

1 

a 

1 

. 



a® 

2a 

1 

i8^ 

— !L 

/(a) 

(M.] 

/ 1 //(a) Y' 

m) 


x — a 

— aj 

1 2!\.T-a; 

x-p 



1 . .1 

a 1 . p 

2a 1 
a® 3a^ 3a 


(3) 


where accents denote differentiation with respect to a. 

Proof. Proceed by induction on n, the degree of the 
denominator <f>{x), and arrange the cases for the same 
value of n in the lexicograpliical order of the partitions 
{r5...i}, where y+5+...+i = ?i. The ordinary case is 
{11... 1}, and all other cases are confluent. For « = 4 
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the order is therefore {1111}, {211}, {22}, {31}, {4}. Any 
confluent case {...p} or {...■pi...}, where p is the last index 
exceeding u nity , is derivable from an earlier case {...(^—1)1} 
or {...(^—1)11...} by letting the index p—\ merge with 
the next sueceeding unit index. 

Now assume the truth of the theorem for this earlier 
case, letting be the factors in <f>{x) answering 

to the indices p—1, 1 which become confluent when 
S = y-\-h and h-^0. The final entry of the column 
answering to 8 in the upper determinant will then be . 


/(y) 

X—y 


/(S) _ f{y+h) 
x~S x-r-y — h 






(p-l) 

+... 


Here every term before the p^^ in the series will disappear, 
by subtracting a suitable multiple of one of the 1 
columns just preceding — for similar series on each row of 
either determinant. So these may be discarded. After 
cancelling the factor from both determinants and 

then letting the required result follows. 


Example. Replace the third columns in (3) above by 
{1, y, y®, f{y)l{x—y)} and {1, y, y®, y®}, and the left-hand 
expression by f{x)j(x — a)^{x — ^){x — y). Assuming the truth 
of this identity, let y = a -f 7i. The third columns are now 


1 


ot ~\~h 
(a-f/l)® 


, 8u /i® 
u+h— + - 


d^u 




and 


1 


a “j“7i 

ia+h)^ 

(a+h)\ 


where u =/(a)/(a; — a). The operation C 0 I 3 — coli— /icol^, 
followed by cancelling and then putting h = 0, yields at 
once the result desired, corresponding to Case (ii) of 19, p. 41. 

Corollary. We append a proof, by a similar induction, 
of the theorem on the value of the confluent alternant : 

... A*) = i7(/3 — a)^ 

D 
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where the index rs is the product of the corresponding 
indices in J. 

Proof. It is understood that = n, that 

a, A are distinct, and that the blocks of confluent 

columns are arranged as in the denominator determinant 
above. Let p(>l) be the final confluent index. 

Assuming the truth of the theorem for the partition 
{...(^—1)1...} we have, say, 

= ( 8 — 

where the product index p — I is obtained from the indices 
of y and 3 in J, and ^ denotes all remaining factors. Let 
3 = y-j-h, so that the right-hand expression becomes 

hP~^{y — -\-h — a) 

Divide by }iP~^ as before and let >0. The left becomes 
the required new confluent determinant, and the right 
is a product of factors such as 

{y—ayp-r+r = {y—af^ 

which involve y with a or any other root distinct from y, 
together with factors such as (j8— a)’’® which are unchanged 
throughout the process. 

Examples. 1. In the denominator of (3) above, the 
value is 

— a)® “ d(aaaj8) = d(a®^). 

On the other hand, A(a^Py} — (■)/~-a)"(^ — a)“(y— j8). 

2. A(a*^Y) = ()3-a)i*{y-a)8(y-^)«. 

3. Integrate with respect to x : f{x)l{x—a)(x~p)(x~y), 
f(x) j{x — a.y{x — ^),f{x) jix — a)*. (Replace f{a)l{x — a) th rough- 
out by/{a)log(a:— a) in the doterminantal fonnulte equivalent 
to the sum of partial fractions.) 

22. The Expansion of a Rational Function. Partial 
fractions give a means of expanding a function of x in 
ascending or descending powers of x. We shall illustrate 
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this for the simple case when all the factors of <f){x) are 
distinct. Let 

^(a:) x—a x—^ A 

Now al{x — a) = a{x~^-\-ax~'^-\-a^xr^-\-a'^x~^-\-...), (1) 

provided that this series converges, which happens if 
|a;|>|a|. This means that x is numerically greater than 
a if both are real, and that the modulus of x exceeds that 
of a if both are complex. For a proof we may refer to 
any text on analysis. We may regard the statement 
just written as a geometrical progression summed to 
infinity or equally well as an example of the binomial 
expansion of ax~\l — 

Now take |a:| greater than the greatest among [a|, 
|^|, ...» |A|, expand each fraction, and add the results. 

Then 

B.{x) =f{x)l(l>{x) — {2a)x~'^-\-{IJaa)x~^-\-{I!aa^)x-^-\-..., (2) 

and this is the development of i2(a:) in descending powers of 
x, vah'd for sufficiently large values of \x\. 

Similarly, when |a:[< |a| we have 

ajix — a) = —a{ar^-\-a~^x-{-a~^x^-\-...). . (3) 

Hence for sufficiently small values of x, in fact when [a;| is 
less than each of |a|, |^|, ..., jA], we have the corresponding 
expansion in ascending powers of x ; 

R[x) =f{x)j<j){x) = — ScutT^ — {Saa~^)x — {Saa~^)x^ — ... . (4) 

Unless one or other of a, jS, .. ., A vanishes this expansion 
is possible. Therefore if a; is not a factor of ^{x) we may 
expand /(a;) /^(cc) in ascending or descending series of powers 
of X, for suitably small or large values of x. The same 
applies when <j){x) has repeated roots, where the work 
may be carried out on terms such as a{x—a)~'^ by the 
binomial theorem. If, however, x occurs as a factor in 
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cf){x), we remove the x before making an ascending expan- 
sion, and afterwards replace it. 

Example. = I {(2-a.)(l 

= x~^(2 — x-\-2x ^ — 

= 2x-^—x-^+2x-^ — l+2x—.., . 

Such a series, involving both ascending and descending 
powers of x, is called a Laurent expansion. Here there 
is necessarily a finite number of negative powers and an 
infinite number of positive powers. 

For a descending expansion there is no restriction on 
the denominator. In this example it is 

a?"*— ... . 


It is very useful to notice that the leading term in 
either expansion is given by the dominating terms of 
fix) and ^{x). In ascending series the dominating term 
is the lowest in powers of a; ; in descending it is the 
highest. 


Examples. 


1, 


*2— 3a: 4-2 


a 

a: — 1 


+ 


h 

x—2' 


where a = — 1, 6 = 2, and this on expansion gives 

- (a:-i +x-^ -)-a:-3 + . . . ) +2(x-'^ + 2x-^ -f 4x-^ -f . . . ) 

= a:-i4-3x-2 4-7a;-3-h...4-(2«-l).'i;-"-l-..., 

if a: >2 or < —2 when real ; or if |a:|>2 when x i.s complex. 
Again 

= ( 1 -}- “f-...) — (l“f" "4" "*h • • • ) 

= + + + + , 
if when X is real ; or if |‘'r|<-l when x is complex. 

2. Expand x^l{x^ — l) in ascending and also in descending 
powers of X. Is a Laurent expansion possible licro ? 



RECURRING SERIES 


53 


23. Recurring Series. When x is not a factor of 
any rational function R{x) may be expanded as above 
in powers of x. Suppose, therefore, that 

The sequence of coefficients p^, p^, ... is then called a 

recurring sequence or series. Such a series is characterized 
by a scale of relation, which means that, after a certain 
value Uq of n, each coefficient is formed in exactly the 
same way by a linear combination of its r immediate 
predecessors. 

A scale of order r = 1 is given by p^ = aPn-i^ where 
a is constant and w = 1, 2, 3, ... in succession. A scale 
of order 2 is given by p^ = ctPn-i-\'^Pn- 2 > where both 
a and b are constant : and so on. It is easy to prove that 
if ^{x) is of degree r the scale is of order r. The proof 
is left as an exercise for the reader, but is here illustrated 
when r = 2. 

Let = q{x) -jrr{x)l(f>{x) 

in the usual way : then q(x) is a polynomial which may 
interfere with the recurrence law for the first few terms, 
but r{x)l<l){x) gives the recurring series proper. We have, 
say, when x is suitably small, 

r{x)jf>{x) = {cx-^d)j{x^-\-gx-^h) 

= Uq-\-U^X-\-U^^-\~U^X^ 

where we assume that since <^(0)9^:0. Multiply 

throughout by x^-^-gx-^h and equate corresponding 
coefficients of aP, x^, x^, ... on both sides of the resulting 
identity. Thus 

d = Uf^h, 0 = b>-\-u^ ,'7+'Woj 

c = % 1i+Uq g, 0 = h+Uz !7+Mi, 

0 = S'+«n-2. 

Since this gives a recurrence relation for in terms 
of Un-x and where n = 2, 3, 4, .... 

Conversely, we may sum a given recurring series to n 
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terms or to infinity (i) if the scale of relation is given, 
or (ii) if sufficient terms are given so as to reveal a scale 
unambiguously . 

(i) Let 5 where Mq, 

are given, and also = a.Un-i-\-bUn- 2 . for w = 2, 3, ... . 
Multiply throughout by ax and also by 6a: 2 . Thus 

axs ~ augX -{-attjX • H-«M„_ia:” + . . . 
bx^s ~ bU(fc^-^...-\-bUn-z^'^+... . 

Hence by subtraction, carried out by powers of x, 

(1 — ax — bx^)s = — auQ)x, 

since every further coefficient of a power of x disappears 
owing to the scale of relation. Thus 

Uf^-\-{Uy—an^x 
1 — ax — 6a; ^ 

which is a rational proper fraction with a quadratic 
denominator (r = 2). To sum the series to n terms we 
proceed in the same way, but retain the non-vanishing 
terms involving x^'^^ and a;”''' 

The procedure is palpably a generalization of the well- 
known method of summing a geometrical progression : 
indeed the latter is the case of a recurring series for which 
1. 

(ii) Given r = 2 and the series l-\-Zx-{-lx^-\-15x^ 

to find the general term, the scale of relation, and the 
rational function s. 

Assume ; then 7 = 3a 4-6, 15 = la 

4-36. Hence a — 3, b ~ — 2. Now proceed as before : 

(l-3a:+2a:2)s = l+(3-3)a: = 1. 

Thus 

„ 1 12 
^ ” 1 -3x+2x^ ~ 1 -a- 1 ■~2x 

= l+3a:+7a;24-...-|-{2«+i~l)x”4-... . 

The general term is here shown. 
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Examples. The reader should take simple rational 
functions such as {2Sx)l(l — 6x + 6x‘^), should expand them 
to several terms in ascending powers of x (using either partial 
fractions or ordinary division), and should then try to recover 
the rational function from the recturing series by the methods 
outlined in the present section. 

1 . Determine the scale of relation and the coefficient of the 
term and the sum to infinity of the recurring series 

4=-^Sx-^2Sx^ — S0x^ + ... . 

For what values of x can it be summed to infinity ? 

2. Sum to infinity 24-orr-f-13a:;‘^+35a?^ + - •• • 

3. Discuss the series l + lO.r^-r 15a;^+21rr®+... . 

4. Show that 

l+o: cos d+x^ cos 2d-\-x^ cos 3^+... 

is a recurring series. So too is the corresponding series 
with each cosine replaced by a sine. 

[Un — 2un^i cos ^+Z^.^_ 2 = 0 ]. 

6. If |cj:|< 1 the sum to infinity of the above cosine 
series is 

(1— cc cos 0) I (1— 2rr cos d+x^). 

6. Discuss the hyperbolic cosine and sine series corre- 
sponding to (4). 

[Answers 1. ! — ( — 3)% 4/(l + 2a;— |:r|<J. 

2. {2 — 5x)j(l—5x+6x^), 3. r=S; 
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THE FUNDAMENTAL THEOREM OF ALGEBRA 

24. Statement of the Theorem. The basic theorem 
regarding algebraic equations may he stated as follows : 
Every equation 

f{z) = 2 + ...+«„ = 0 

in which the coefficients are arbitrary, real or complex 
numbers has at least one root z = a+ib, where a and b 
are real. 

The first satisfactory proof of this theorem was given 
by Gauss {Werke, iii. 1) in his dissertation : Demonstratio 
nova theorematis omnem functionem algebraicam unius 
variabilis in factores reales primi vel secundi gradus resolvi 
posse (Helmsted, 1799). Gauss incidentally criticized the 
earher defective proofs put forward by D’Alembert, 
Euler, and Lagrange. Gauss gave two further proofs 
in 1815 and 1816, but returned to his original method 
in 1849. A new proof was given by Cauchy {Cours 
d’analyse algdbrique, ch. x, 1821), and later by Sturm 
{Journ. de MatMmatique, i. 1836). 

Although this theorem is fundamental for algebra, its 
proof belongs to the theory of anal 3 rtic functions, which 
is a branch of analysis. The simplest proof runs as 
follows: f{z) vanishes if l//(s) is infinite. Now l/f{z) is 
an analytic function of z and, according to a theorem of 
Liouville, must either be a constant or else become infinite 
at one or more values of z. Hence f(z) vaiiisbes at least 
once. Lionville’s theorem allows this valium of z to bo 
either inftnito or finite. Now f{z)->oo if s— >oo ; hence the 

36 
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value in question can only be finite. This proves the 
theorem. 

A direct appeal to contour integratioir may be made 
which obviates the use of LiouviUe’s theorem. The 
reader will find the matter discussed in textbooks on 
the complex variable and function theory ; for example, 
MacRobert, Functions of a Complex Variable (1917), 
pp. 67-69. 

The original method of Gauss may be sketched as 
follows : Let z = x-{-iy 

and /(z) =f{x+iy) = u{x, y)-Viv{x, y), 

where the polynomial f{z) has been separated into two 
parts, real and imaginary. For instance, 

2 ®-|- 2 z +3 = {x-{-iy)^-{-2{x-{-iy)-\-^. 

Here u = x^-~Zxy^-{-2x-{-^, v = Zx^y~y^-\-2y. 

Both functions u and v are polynomials in x and y whose 
coefficients are real. Now suppose the curves u = 0, 
=: 0 to be drawn, for Cartesian coordinates x and y. 
If we can show that these curves intersect at a real finite 
point P = {x, y), then at this point both the polynomials 
u and V vanish simultaneously. Hence f{z) vanishes, 
and the point P represents a root z = of the original 

equation /(z) = 0. 

Now let a; = r cos 6, y = r sin 9. Then 
z — y(eos 6-\-i sin 6), 

and by Demoivre’s theorem z^ ~ r‘^{cos7i9-\-i smnd). 
Hence 

u = r”cos ^ == r”sin n6-\-r ”’~^ {...)-{-••• . 

The polar coordinate forms of the curves may therefore 
be written 

cos n6-{-r-'^X = 0, sin n6-\-r~^Y = 0, 
where X and Y are terms in cosines and sines of multiples 
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of 6, with powers of r in the denominators. If r is large 
we obtain cos nd — 0, sin = 0 as approximations to 
the curves, that is to say, radii from the origin at equal 
angles 2iTjn starting from 6 = for the cosine, and 
0 = 0 for the sine. If with centre at the origin O a circle 
r = J2 is drawn with a large radius R, and a regular polygon 
ABC... is inscribed, with 4?i vertices beginning at 0 = 0, 
the above results mean that, as the curves w = 0, 

V — 0 must approximate to the alternate radii OB, OD, . . . 


0 



for u, and OA, OC, ... for v. An asymptote of the u cmve 
woidd be parallel to OB and at a finite distance from OB. 
Thus it would cross tlie circumference ABC at a point Q 
such that the arc QB would remain finite while QA would 
tend to infinity, as OB = i2— >oo. Similarly for each vertex. 
Hence when R is large enough the u curve rims out of 
the circle near the n points ..., while the v curve 
runs out near the alternate points A,G, ... . 

Within the circle the curves are continuous, so that 
the 2ft points near 5, D, ... on the u curve must link up in 
pairs by means of n arcs, however complicated or twisted. 
Similarly for the v curve. It is topologically impossible 
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to draw such arcs, starting at alternate circumferential 
points for u and v, without at least one crossing of a it 
arc with a v arc. Such a crossing gives the required 
point P and proves the theorem. 

The above is a sketch of the proof. It obviously 
involves a knowledge of the algebraic curve, given by a 
real polynomial in x and y, and particularly the property 
that each arc of such a curve is continuous. The proof 
depends ultimately on the property that if Q, B, 8, T are 
four points in order on a circle, a continuous path from 
O to 8 within the circle must cross a continuous path from 
Pto T. 

25. The Product Form of an Algebraic Equation. 
Let us now return to the notation f(x) for a polynomial, 
where x may be real or complex. The fundamental 
theorem has shown that a root a of the n** degree equation 

f(x) = aoa;”+aia:”-i4-a2^”~®+-”4-a«_ia;+a„ = 0 

always exists ; that is, /(a) = 0. Now perform the 
division /(cc) — - (x — a). We may write as before 

f(x) = q(x)(x—a)+r(x). 

Since the divisor is linear the remainder is either a constant 
or zero. Thus r(x) = c, where c is a constant which 
may be zero. Now put a; = a in the identity, which 
becomes /(a) = c. Hence c vanishes with /(a), so that 
x—a is a factor of f(x), whenever a is a root of f(x) = 0. 
Prom the first step of the actual division we notice that 
q(x) is a polynomial with a^x^-^ for leading term. Hence 
f(x) takes the form 

f(x) = (a;— 

(Actually 6^ = aga-j-a^^, = aga"+aya-\-a^, etc.) By the 
fundamental theorem, q{x) = 0 also must have a root 
such that X — ^ is a factor of q{x). Hence 

f{x) = {x—a){x—^){agX^-^-i-CjX^-^-i-...-\-Cn^ 2 ). 
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The process may be repeated on the quotient until the 
jSnal quotient is a constant, in fact. This happens 
when n such linear factors have been segregated. Thus 
we shall have 

f{x) = (Iq{x a)(£C A), Uq^O, 

where the n numbers a, A are called the n roots 

of the equation f{x) = 0. We have reduced f{x) to its 
product form, or have resolved the polynomial into n 
linear factors. 

Example. f{x) = 2a;® — 5a;®— 9a; + 18. 

Here/(3) = 0, so that x — 3 is a factor. We have, by division 
or by Horner’s method, 

f(x) = (2a;®+a;-6)(a;-3) 

= (2a; — 3)(.a; + 2)(a;-3) 

= 2^a;-|^(a;+2)(a;-3) 

where the roots are —2, 3. We note that the leading 
coefficient, oSq = 2, survives as a factor in the final resolution. 

26. Repeated Factors. Equal Roots. It may happen 
that some of the numbers a, j8, ..., A are equal. If exactly 
r of them are equal to a then we say that a is a root repeated 
r times, or is an r-fold root. Clearly (a;— a)’’ is then a factor 
of f{x). Allowing for such repetitions we have now 
expressed /(a;) as 

fix) = ao{x—a)^x—^)^...{x—X)\ 

where = n. Hence the indices r, s, ..., t 

form a partition of n, and the simple or unrepeated case 
corresponds to the partition {11...1}. Obviously f{x) 
may have either one, two, ... or distinct linear factors : 
it cannot have more, for this would imply a leading term 
infix) of degree exceeding n. 



REPEATED ROOTS 


61 


Again, if B is any number distinct from a, j8, A then 
each of 6 — a, 6—^, ... is non-zero. Hence 

m = a^{d-ane-^Y...{e-\y^o, 

so that f{6)^0 and B is not a root of the equation. We 
have found n roots (allowing for repetitions among them) : 
we see that no other number is a root. Hence there are 
exactly n roots of the equation. 

This completes the result of 13, where it was proved 
by rational methods that the equation f{x) = 0 could not 
have more than n roots (p. 23). Apart from the order in 
which the factors occur the resolution of /(a;) into its factors 
is unique. 

The following theorem concerning an equation with 
repeated roots is important : When the n^h degree equation 
f(x) = 0 has exactly r roots equal to a then f(x) and its first, 
second, ..., (r— 1)^^ derivatives vanish at x = a, hut its 
r^ and following, up to the n^^ inclusive, do not. 

Proof. In this case we can write f{x) = {x—aYi}}{x) 
where Hence, by differentiation, 

f’{x) = r{x—aY-mx) + {x—aYilj'{x), 

80 that f'{x) contains the factor {x—aY~^- Its cofactor 
is rifj{x)-{-{x — a)iff'{x), and, when x — a, this is equal to 
rtfj{a) which is non-zero by hypothesis. Thus the first 
derivative contains the factor x — a exactly r — 1 times. 
The same method shows that /"(*) contains the factor 
r — 2 times : and so on until the (r — 1 )** derivative 
contains the factor once, and the is free from the 
factor. Thus the theorem is proved. 

Example, fix) = jb® — 4:X^-{-5x — 2, 

f'{x) = 3 . t 2 - 8 .' b - 1 - 6 , f"(x) = 6x— 8. 

Here fix) = (rB-l)%r-2),/'(:r) = (a,’-l)(3a:-5), and/(l) == 0, 
f'll) — 0, /"(I) # 0- The root 1 is repeated twice. 

Wo may noto that if, as herc,/'(£c) can readily be factorized, 
it is easy to find tlic repeated roots. The second zoro 5/3 
of f'lx) is of course not a root of fix) : nor is the single zero 
off" lx). 
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The geometrical interpretation of a repeated root is 
simple. If /'(a) = 0 the gradient of the graph is zero at 
X = a, and if also /(a) = 0 the graph meets the axis of 
X at the point (a, 0). Hence if a is a repeated root of 
the equation f(x) = 0 the graph touches the axis of x 
at (a, 0). It is worth while drawing the graphs of f, 
and /" in order to gain insight into the behaviour of / 
at a repeated zero. Furthermore, by Taylor’s theorem, 
we have 

/(®) «■) =/(a)+(^— a.)/'(a)+...+ ^ /^”>(a). 

If /(a) and its first r — 1 derivatives all vanish but not the 
r**, then (a; —a)’’ is a factor of every term in this series 
and therefore of f{x). Hence the converse of the above 
theorem is true. 

An important corollary of the theorem, having reference 
to the G.C.M. off{x) a,ndf'{x), is as follows : If f(x) contains 
the factor x — a exactly r times, then G(x), the O.C.M. of 
f(x) and f'(x), contains the same factor exactly r— 1 times, 
and G{x) is composed entirely of such repeated factors of f(x). 

' Proof. If /(a) and /'(a) both vanish then a:— a is a 
common factor off{x) and /'(as). Since it occurs to degree r 
infix), it occurs to degree r— 1 in/'(a:), so that 0{x) contains 
this factor to the power r— I also. 

Now 0{x) cannot contain a factor x~d unless both 
f{6) and/'(0) vanish. Hence 0{x) can only be 

(x — a) ’■~^(a; — . . (x — 
where a, j8, ..., A all differ. 

Again, if f{x) is any given polyiaomial, not necessarily 
factorized, we can find both /'(a:) and G{x) by the ordinary 
rational methods. Also we can write 

f{x) — G{x) h{x) 

where h{x), the cofactor of G{x) mf{x), can be ascertained 
by long division. Every zero of h{x) is manifestly a zero 
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off{x), but it is more interesting to note that, conversely, 
every zero of /(a?) is also a zero of h{x) and that h{x) has no 
repeated factor. Indeed 

(x-anx-^)^..{x-Xy 
G{x) “ (cc— A)*“^ 

= a)(a;— j8)...(a;— A). 

The equation h{x) = 0 is called the reduced unrepeated 
form of f{x) = 0. We note that if any index t is unity 
0{x) has no corresponding factor. 

In practice all repeated factors may be removed by 
this method, even if the separate factors x — a cannot 
easily be identified. If they can, so much the better. 

Examples. 1. Solve 

x’ —x^ — + — = 0. 

(The roots are 1, 1, 1, —1, —1, 3, — 3.) 

2. The reduced unrepeated form of 

x’’ —a;® +2a;®+2£C^ — 3a;®+3c»® — 4 

is -\-x+2){x — 1), There are three pairs of repeated roots 
and one unrepeated. 

27. Complex Roots of an Equation. Starting with 
the known fact that a quadratic with real coefficients may 
fail to have real roots, we can easily construct an equation 
of the degree with less than n real roots : thus 
{a;2+l)(a;®+aj+l) = 0 is a quartic equation with no 
real roots ; and (a:— = 0 is a quintic with only 
one real root. We must therefore be prepared to consider 
complex roots of a real equation 

f{x) = -\-an = 0, 

where each coefficient , is real. Incidentally, we may 
merge the case of complex equations, where the are 
complex, in that of the present case where the coefficients 
are real, by the method of 5, Ex. 2, p. 11. The roots of a 
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complex equation of degree m may be regarded as roots of 
a real equation of, at most, degree 2m. 

Example, — = 0 is a complex quadratic whose 

roots are included among the three roots of the real cubic 

3j3_gj2_j_2 = 0. 

Let us then confine ourselves to real equations. If 
is any complex number we have 

/(a+i^) = A-\-iB 

where a, jS, A and B are real. This result is obtained by 
a straightforward reduction due to replacing by — 1 
whenever it occurs. Since {—i)^ — —1 we should obtain 
in exactly the same way 

f{a-i^) = A-iB 
where A and B are the same as before. 

Example. 

a(aH-ij8)^+6(a4''^i8)4-c = a(a ^ — 

a(a — i^)-+b(a—'i^)+c = a(a^ — ~hba+c—i(2aaj3-l-bj3). 

N’ow suppose that a+*j8 is a root of f(x} — 0, so that 
A -i-iB — 0. Since A and B are real, this is only possible 
when A = 0, B = 0. Hence A — iB == 0, so that 
/(a— ijS) = 0, that is, a—i^ is also a root. Hence we 
have proved the following property ; 

All complex roots of a real equation occur in pairs, 
such as a^ijS, which are conjugate complex numbers: the 
number of complex roots of a real equation is therefore even. 

Since {x-—a—i^){x—a-\-i^) — (a;— which is real 
and positive whenever x is real, it follows that every real 
polynomial may be expressed in the form 

fix) = ao{x—a)...{x^-\-px+q)..- 
— aon(a;— a)n(a;2+i?*+9')j 

where n indicates a product of typical factors, and in 
each quadratic factor p^<4:q. Tliis is a resolution into 
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real factors, linear for each root a oif{x) — 0, and quadratic 
for each pair of conjugate complex roots. Of course 
there may be repetitions among the roots, but if a complex 
root is repeated, so is its conjugate, and so therefore is 
the real quadratic factor 

In terms of real numbers we can now say that every 
real polynomial can be factorized into real linear or 
irresoluble quadratic factors, and that such a factorization 
is unique. 

Examples. 1. Find the real factors of 
03^ + 1, 03^ — 16x^+4033 — 25. 

2. Solve the equation 27x^— 45x2 +26x— 4 ^ given that 
it has a repeated root. 

3. Find the reduced unrepeated form of the following 
equations : 

(i) — 18x3 + 69x2+108x + 36 = 0. 

(ii) x^-10x2 + 37x2-60x + 36 = 0. 

(Each equation has two pairs of repeated roots.) 

4. Solve the equation Sx^+lGx^— 64x+64 == 0, which 
has two roots of the type 

5. The equation x^ — 4x^+7x2+6 =0 has no real roots. 

(Consider (x— 1)^ + (x+2)2 + l.) 

6. Find four real roots of x®— 47x^+l =0. 

7. Solve (x — l)"^ = x’ — 1, which has two repeated roots. 

[Answers : 

2. Put 3 x==2/; h -i(l±Vl^)- 

+2-A^X2i, 4 + 2 2 . 

±4(3±'\/5).] 



OH APTE E V 

PROPERTIES OF THE COEFFICIENTS OF AN 
ALGEBRAIC EQUATION 

28. The Elementary Symmetric Functions. Let a, 

A be the % roots of the equation 

Then f{x) = aQ{x—a){x—^)...{x~X). . . (2) 

Equating these two expressions we have an identity which 
is true for all values of x ; hence we may multiply out the 
bracket factors and equate coefficients of corresponding 
powers of x. Thus 


= 

(IqSo, = 

= a(j(a^+ay+--) = (3) 

ao^'a^y = ao(<^^r+--*) = ~ % 


aQa^...X = (-)X> 

or Ua = — Uj/Uq, ZajS = U 2 /^oj etc. These 

relations express the coefficients in terms of the leading 
coefficient and of the elementary symmetric functions of 
the roots (Girard, 1629 ; Harriot, 1631 ; Vieta, 1646). 
The elementary symmetric function is the sum 2 of 
the combinations of the n letters a, j8, ..., A taken m at a 
time, so that S has n^^) terms, where 


n. 


(m) 


n\ 

m,\{n—m)\ 


n{n~\)...{n--m-{-\) 


1 . 2 . ... w 



ELEMENTARY SYMMETRIC FUNCTIONS 


67 


the well-known binomial coefficient. We may refer to 
these sums as the sums of m-ary products of the roots. 

Examples. 

For raa;®-4-26a;-f c, Ea = a+yS = — 2&/a, a^S = cja. 

For +a-jpc^ Ea = a+^+y = —aja^, 

Ea^ = a/S-f'O.y + jSy = a^y — — Cb^jaQ. 

For a quartic (biquadratic) Ea^ has six terms and 
Ea^y has four. It is important to become thoroughly 
familiar with these relations (3) for the cubic and quartic 
cases. Note that occurs in each denominator of a 
sum of m-ary products, and that the suffix in the 
numerator determines both the sign and the number 
of factors in each term of the series. This number is 
called the weight of the term and of the series and of the 
eoefficient. 

In particular cases care is needed, since the notation 
E is devised to help only in the general case when all the 
roots are distinct. For instance, if f{x) — {x—a)^{x—b), 
theni7a = a-\-a—b = 2a~b,Ea^ — a^—2ab, a^y = —a^b. 

The relations are of immediate help in the solution of 
an equation whenever the roots are rational, or integral, 
or satisfy simple relations, as when they are in arithmetical 
progression. 

Examples. 1. If all the roots and coefficients are integers 
Uq must be a factor of each and each root is a factor of a„. 

Solve — 6x^ + 1 1® — 6 = 0 by trial. 

2. Solve flj® — 5x^ — 58a; = 88. 

3. Solve a:®— 3a.'i;* + (3a®— a(a®— &®) = 0, having given 
that the roots are in arithmetical progression. 

4. Solve 8a;® — 52a:® 4- 78a;— 27 = 0, given that the roots are 
in geometrical progression. 

5. Solve a;*-|-a;® + 7a;®-|-4a:-|-12 = 0, given that one of the 
roots is a pure imaginary. 

(Take®, 6, ffiic as roots. The roots are ±2t, — ^(Izb^-v/lL) 
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29. Allied Equations and Polynomials. It is an 
immediate consequence of the jn-ary relations of 28 (3) that 

= aQ{x—ay){x—^y)...{x—Xy) ( 1 ) 

This is called the homogeneous form of the identity due 
to factorizing the poljmomial f{x) . Other variants of the 
same theme are obtained by putting y = —1, thus, 

ao*” +a2a3”- 2 — ...+(— )”a„ 

= aQ{x-\-a)(x+^)...{x-{-X), . . . (2) 

or again by writing 1 for x and x for y : 

«() +aia; q-aga: ® + . • • 

= aQ{l—ax){l~^x)...{l~Xx). . . (3) 

The zeros of this reversed poljmomial are a“^, ..., 

X~^, that is, the reciprocals of the roots oif{x) = 0. 

Zero and Infinite Roots of an Equation. The 
equation f{x) = 0 has a zero root if, and only if, = 0, 
as is obvious. When the last r coefficients vanish but 
the coefficient does not, tiion there are manifestly r 
repeated zero roots. In this case the reversed equation 
just written down appears to have r infinite roots, since 
the root a of the original equation corresponds to or^ 
in the reversed equation. Similarly, when the first r 
coefficients of /(cc) vanish biit does not, we might 
consider /(x) = 0 to have r infinite roots. This is, however, 
a sophisticated way of viewing the situation, since the 
equation is no longer of the n^^ degree but of the (n— r)*'* 
degree. 

It is, however, more fruitful to consider a limiting case, 
and first to ask what is the effect on the roots if the 
coefficients vary slightly ? To answer this, let us consider 
the graph of /(x) when each coefficient is numerically less 
than e, supposed small. As on p. 18, we can take e small 
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enough to ensure that f{x) will be less than any assigned 
small quantity. Let the ordinate of such a curve be 
called rj, so that rj is small. Now take any finite values 
of the tti and alter them by adding these small coefficients. 
The resulting graph will differ vertically at (x, y) to an 
extent iq, which may be positive or negative. Hence it 
must cross the axis of x at points which are near to the 
crossings of f{x) on the axis : that is, the real roots of 
f{x) = 0 are altered slightly when the coefficients are so 
altered. Each real root is in fact a continuous function 
of the coefficients : so too is a complex root, but we shall 
not attempt to prove this. 



Suppose that but that aQ—^O, while aj-^a:^0, 

an->b^O simultaneously. Let us consider the effect on 
the reversed equation 

-j-a„_iX ^-^ + . . . ^a^x +ao = 0 . 

It wiU tend to a form for which one root is zero and the 
rest are non-zero. If one root a only of this equation 
tends to zero, then the sum of (tc— l)-ary products of roots 
contains one term ^y... which remains finite, while the 
rest of the terms contain the factor a and consequently 
tend to zero. If two roots tend to zero, every term of 
the (?i— -l)-ary product sum tends to zero. Hence %-»0, 
which contradicts the supposition. Accordingly if Oq— > 0 
but % does not tend to zero, then one root and one only 
of the reversed equation tends to zero, and one and one 
only of the original equation tends to infinity. 

Example. Solve ea:2-]-2a;— 3 = 0. 
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30. Further Symmetric Functions of the Roots. 

A second set of important symmetric functions of the 
roots of f{x) = 0 may be introduced as follows : 

Definition . — ^The coefficient of a:’" in the expansion 

«o(ao + • • • 

■ ( 1 ) 

is called the homogeneous product symmetric function of 
order r in the roots of the equation 

f{x) = aoa:”+aia;"-i+...+a„_iX+a„ = 0. 

We can in fact write the left-hand expression in (1) as 

a^o{«o(l —ax){l — /3x). . .}-! 

or {l+ax^a^x^-\-...){l^ fix +^^x^ + . (2) 

provided that each of aa;, fix, ... is numerically less than 
unity, which is true for all small enough values of x. In 
which case by equating coefficients we have 

^0 = 1,^ = 2Ja, ^2 = Da^-\Safi, — Sa^ -\-IJa^fi -{-Dafiy, 

and so on. Obviously h,, is the sum of all distinct terms 
composed of exactly r factors a, fi, ..., allowing repetitions. 
(These symmetric jfunctions are sometimes referred to as 
sums of m-ary powers and products, sometimes as aleph 
functions of the roots, a name originally given by Wronski). 
The summations involve more terms, and are more 
complicated than those of the elementary symmetric 
functions, which we shall now denote by eg, and so on. 
But the e and the Ji functions are equally important, and 
between them there runs a close parallelism. (See for 
example, Aitken, Determi-nants and Matrices, pp. 113-121, 
where the elementary symmetric functions are denoted 
by a^.) 

The number ,^1.^ of terms in the summations Ti^ is 
given by putting a — fi == ... = X = 1, since each term 
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then reduces to unity. But this gives, instead of the 
identity ( 1 ), 

By the binomial theorem we find that 


= {n+} —l)^r) — j — 2 — 3 


wliich therefore gives the number of ways of choosing 
r things from n things when repetitions are allowed. 


31. Relations between tbe e and the h Functions. 

From 29 (3) we have at once 

F(x) = (1— aa;)(l — ^x)...(l—Xx) 

= l—e^x+e^x^— 

while 

For convenience we may take Cq = 1, = 1 . Here we 

notice that the e-series is finite while the A-series is infinite. 
On multiplying them together we have the identity 

1 == F{x){F{x)}-^ = (1— eia3+...)(l+Aia:-f ...), 

so that the coefficient of each positive power in the 
expansion must vanish. Accordingly we have 

— ^1 

^2 — = Oj , . , . (1) 

63 

relations which are called Wronski’s relations (Aitken, 
Determinants and Matrices, p. 114). Solving from these 
each way in turn, we have 

H = Af— 7^2, Ag = ef— Cg, 

eg = A|— 2AiAa+A3, Ag = 26162 + 63 , 


( 2 ) 
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and so on. Hence either set may be expressed rationally 
and integrally in terms of the other ; and the inter- 
changeable rdles of the e’s and the h’a in such formulae 
cannot fail to be noticed. 

Example. Prove that 

®0®r == ( — ®0^r + • • • +®v^0 

32. The Sums of Powers Symmetric Functions. 
A third set of symmetric functions of the roots is the 
sums of powers : 

== ScL = • • “t"^j 


It is possible to express each of these sums rationally 
and integrally in terms of the elementary symmetric 
functions e^. The results are embodied in Newton’s 
formulae (1707), which run as follows ; 

UoSi+ai = 0, 

• • • • ( 2 ) 




where of course = 0 when p exceeds n. To establish 
these relations we calculate f\x) in terms both of the 
coefficients and of the roots, and then equate corresponding 
terms in powers of x. Now 


f'{x) = moa:"-^+(TO— 


Also f{x) = 


f{^) , f{^) 


X- 


4- ' 

-a X — t- 




fix) 

x—X 


( 3 ) 


as is at once seeir by differentiating the product form of 
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f{x) in 25 fp. 60). Again, by ordinary long division we 
have 

X — a 

without remainder. The process yields an apparent 
remainder /(a) which of course vanishes. (The quotient 
just written down is indeed an alternative way of writing 

aQ{x—^){z —y ) ... (a; — A) 

which would result from dividing the factorized form of 
f{x) by x—a.) Let a series similar to (4) be formed for 
each root in turn. This gives n such series, and their sum 
wiU therefore be equal to f'(z), so that 

/' (x) = + (tto®! +naj)x'^~ ^ + {UgSg +aiSi +^ 2 ) 2 ?”“®+ .... 

For example, when n — 3, the three terms involving 
a.n -2 giye a^a+cq+aojS+aj+aoy+ai for coefficient, which 
is a^Si+Soq. Similarly for the other terms. 

On comparing coefficients in the original and in this 
final form of/' (a:) we have 

(%—!)% = aQS^-^-na^, (w— 2 )a 2 =aQS^-\-a^s^-{-na^, etc. 

whence the first n of Newton’s formulae at once follow. 

Again, let F{x) — where p 

is a positive integer greater than n. Since x'^-'^ is a 
factor, and f{x) is its cofactor, we mfer that the roots of 
F{x) = 0 are a, j8, ..., A together with p—n zeros. Hence 
the sums of powers of its roots are the same as for j{x). 
By writing out the p^^ formula in Newton’s relations for 
F{x) according to the above method we obtain 

But all the coefficients subsequent to a„ are zero, so that 

= Oj • • (5) 

which is the form of Newton’s relations for p >n. 
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The following alternative form of Newton’s relations is 
easily deduced, since = {—Ya.^ : 

-Cl = 0, whence Sj = gj, 

^2 = 0, 6'2 = ef — 2e 

53— eiS 2 +e 2 ' 5 i— Scg = 0 , 53 = ef — SeiCgT-Sca, 


This shows that each of the s-functions may be expressed 
rationally and integrally in terms of the m-ary product 
sums of elementary symmetric functions of the roots. 
These equations were first given by J. Gregory in 1675 
for the case of the septimic equation {n = 7) as far as s^. 

The first set of relations in (6) between the s and the e 
functions is linear in both sets of variables, and may 
accordingly be solved for either set in terms of the other. 
When this is done determinantally we find that 


St} — 


% 

^1. 

262 

1 

ei 


1 

■®2 

1 


2eo 


1 

1 

^ 2 

1 

■Si 

2 

SCa 

62 

% 

3! 


X 


ei 

1 



h 

1 


2^2 

e 

1 

1 

<?2 

^1 

2 

3e, 

^2 

61 1 

li 

50 


■*2 

■Sl 

' 464 

63 

^■2 64 1 

1 


«3 



and so on. 

If we apply the above reasoning to the equation 


( 7 ) 


a:”-faia:”-i+a2a;”-2+...+a^ = 0 . (8) 

where = 1, we have at once Za^ = s^ — a\—2a^. 
If all the roots are real each is positive, so that 
Therefore if aj<; 2 a 2 — 2^2 there must be some complex 

roots. This proves the following result ; 
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The equation (8) has at least one pair of conjugate 
complex roots whenever 

(1) af<2a2, (2) af = 2a2, (3) Oi = 0, a2>0, 

(4) = ag = 0. 

It by no means follows that when a|>2a2 all the roots 
are real, as the case x'^-rQx-\-\l = 0 shows. Here 36>22 
but the roots are complex. 

33. Symmetric Functions in General. Any function 
of n arguments a, jS, ..., A which is unchanged by any 
interchange among the arguments is called symmetric. 
For example, x^y-\-xy^, sin(a;'-j-y), {x—y)^ are symmetric 
functions of x and y, but the function x—y is not, since it 
differs from y—x. We have already considered several 
symmetric functions of the roots of an equation, and 
the fact that the coefficients of 

are instances of such functions suggests their importance. 
Actually they play a fundamental part in many branches 
of higher mathematics, beginning with the theoretical 
solution of a cubic or higher equation. 

Fundamental Theorem on Symmetric Functions. Every 
rational integral symmetric function of the n arguments 
a, j8, . . . , A can be expressed as a rational integral function 
of the n elementary symmetric functions %, e^, e„. 

Proof. This follows by induction. Every such 
symmetric function >S' is a sum of terms ca’‘jS®...A*, where 
c is a constant. Select from 8 a term of lowest degree 
in the n arguments, that is, one for which the sum 
r4-5+...+i is least. Collect together all the terms which 
are symmetrical with it and necessarily have the same 
coefficient c^, calling the sum of these CyS.^^. Since 8 is 
symmetrical and also, then manifestly S—c^Si is so too. 
Repeat the process on this by collecting out of it a 
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symmetrical set : and continue until the whole 
expression 8 is exhausted. This expresses 8 as a, finite 
sum of monomial symmetric functions such as 8i. Thus 

8 = CiySfi+C 2 /Sf 2 +...+Cj,-Sft, where 8^^ = Ua‘^^‘ ..A*. 

For example, when 

S = x^~3xy-\-x^y-}-y^+2x-\-2y+xy^, 

8i = x+y, = xy, = x^y-\-xy^, = x^+y^. 

Among the terms such as 8^ several may have the same 
degree p, as 8^, 8^ in the above example. When this 
happens we arrange these terms in ascending lexical order 
of the partition belonging to the positive integer p, 

lower indices taking precedence over higher (see p. 2) 
and Corresponding to each partition there 

is one monomial symmetric function, and vice versa. 

If we can prove the theorem for a monomial symmetric 
function of any degree p and any set of indices 
it must then be true for every 8. We prove it for 
monomials by induction, assuming it true for all monomials 
of lower degree than p, and for those with an equal p but 
an earlier partition in the lexical order. 

Write a''jS*...A* = (ajS...A)''(j8y...A)®~''... = T, which can 
be done since we have assumed, without loss of generality, 
that 

For example, — (a/5yS)®(j8yS)®S®. 

Let the elementary symmetric functions to which these 
groupings of the n arguments belong be e„, e„_i, etc. 
Then T is a term in the expansion of 

in terms of the arguments a, jS, ..., A. 

For example, a®/3®y®S^ is a term in 
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But since J/ is a symmetric function it too may be expressed 
monomially in lexical order 

where is necessarily the monomial due to the set of 
indices (In the example jSj consists of terms of 

indices 2, 5, 5, 7 only.) Hence E-^ — S-^ and c\^Q. On 
dividing by the non-zero constant c\ we express E-^ or 8-^ 
rationally and integrally in terms of E and earlier 
monomials E^, E^, Ej,. for which the theorem has been 
assumed true. Since E is itself in the required form this 
proves the theorem by induction. The reduction is unique 
(p. 148). 

34. Further Theory of Symmetric Functions. 
Taking the case of the third order, let 

Cl = a+^+y, eg = ^y+ya+a^, = a^y, 

so that a, /S, y are the roots of the cubic 

— egOJ+^s. 

Multiply this by x and substitute for x® in terms of lower 
powers : then 

cc* = e]^x^—e2X^-j-esX 

= (ef— Cglx^— (e^eg— e 3 )x-i-eie 3 . 

Similarly, x® is expressible in terms of x®, x®, x and therefore 
of x^, x^, X® ; and so on. We infer that, if x = a or yS 
or y, we can always express a power of x as a quadratic 

x*’ = E^x^-i-E^x-hEs, X = a, y, . (1) 

where E^, E^, E^ are pol 5 momials in e^, Cg, eg with integral 
coefficients. 

Now take any three positive integers I, m, n and form 
the determinant 

y' 

a™ y™ 

a" y" 


A = I a^/3“y” I = 
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We can therefore write this as 

A= F^^^+F^^+F^ FiY^W^y^-F^ 

G^a^-{-G^a+0^ G-^^-{~Gz^-\-G^ G-jy^-\-G^y-\-G^ 

E^ E^ Es 2 ^2 y2 

F^F,F, ^ y ^A{e)A{a^y), 

G^ G,G^ 11 

let us say, by the multiplication theorem of determinants 
(Aitken, Determinants and Matrices, p. 80), where the 
nine quantities E, F, G are polynomials in e^, eg, eg with 
integer coefficients, and hence A{e) is so too. 

If a, |S, y all differ the cofactor determinant 
is the difference-product (a— j8)(a— y)()3— y), and the 
quotient 

|a^j8”*y"| -y |a®y8^y°| 

is identically equal to id(e) which is a symmetric function 
of a, j8, y. We caU this quotient a bialtemant {Ibid., 
p. 113) of order three. Exactly similar treatment applies 
to any number of roots a, /3, A. A bialternant is 

characterized by its arguments a, y, ... and its positive 
integral indices l,m,n, .... Thus 

Every biodte'rnant is expi'essible as a polynomial in tlie 
elementary symmetric functions ej with integer coefficients. 

Next, a function ^(a, y) which changes sign whenever 
any two of its arguments a, )S, y are interchanged is called 
an alternating function of its arguments ; and similarly 
for any number of arguments. Eor instance, 

^(a, y) = — a, y) = y, a) = etc. 

Hence the square of an alternating function is a symmetric 
function ; so also is the product or the quotient of two 
alternating functions. But if i^(a, j8, y) is symmetric, 
the product ftifs alternates ; in fact 

^(a, y) ^(a, jS, y) = a, y) a, y). 
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The alternant |a^/S^y®| is an alternating function, and 
so is the more general |a^ j. Their quotient, which 
is a bialternant, is then symmetric. 

Alternative Proof of the Fiandamental Theorem 
for Symmetric Functions. The above considerations 
lead to a notable proof of the theorem that every monomial 
symmetric function of a, y, ... is a polynomial, with 
integer coefficients, in e^, e^, e^, ... . 

For let Za®/S“y’' — tfs he such a monomial symmetric 
function. Multiply it by 

^ = [a^yS’-y®] = 27d:a®jQ^y®. 

Then cf>ip = 

where the whole summation extends to six terms due to 
permuting a, j8, y among themselves, and again to terms 
due to permuting p, q, r. Hence 

cf>4t = la»+2^«+V| + ja«+2^i’+Vl +■•• • 

Divide throughout by ^ = la^^^y®!, and this at once 
expresses ^ as a sum of bialtemants such as 

each of which we have seen to be equal to a pol3momial 
in the elementary symmetric functions e and with integer 
coefficients. 

The above method is general and applicable to n 
variables a. If ^ = Z'a®^^y’'...A® is a monomial symmetric 
fmretion where p-i-q-\-r-h...-\-s = n and some of the 
indices may be zero, the number of terms in this summation 
is the number of distinct permutations of the n letters 
p, q, s taken all at a time. 

An Alternative Based on Cauchy’s Proof. Let us 
segregate one root a and let = a 4-/1, — o/i+Za, 

63 = 0/2 4-/3, ..., e„ = afn-i, 80 that the expressions 

A = ^4-y+---> /a = •••> fn-i = ^y...A 
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are the elementary symmetric functions of the n~l 
remaining roots. If i/f is a monomial symmetric function 
of the n roots, arrange it as 

where each ifsi is a function of the n—l roots, in which it 
is necessarily symmetric. Assume the theorem true for 
these n—l roots, so that each tjs is expressible as a function 
of the fi. But 

= e^— a, /a = Co— a/i — eg— and so on. 

Hence each if) is also a poljmomial function of the and 
of a ; let us say finally that 

where each E is a polynomial in the ej-, and by (1) p. 77 no 
power a® higher than is needed. 

By symmetry exactly the same equation is satisfied 
by y, . . . , A, so that 

ij/ = EQ-^Ei^-\-E^^-\-...-{-En-i^”'~^, etc. 

Either all of E^—tp, E^, E^, ..., vanish, or else by 
elimination |a^/3^y2...A”| = 0, which cannot be true since 
this determinant is equal to the difference-product of 
arguments a, ..., A all different. Hence iIj = Eq and 
all the other E^ vanish. This expresses ip as a polynomial 
in the 6^, and the rest follows by induction. 

Examples : 

1. Evaluate s^, Sg, for jc3_5a;-f.i=0. 

2. Evaluate I!a~^ for 4ai*— 3a:+l=0. 

3. Prove that 


•^0 *^1 *^2 *^3 


1111 



a jS y 8 

^2 '^3 '^4 ^5 


a “ jS " y 2 8 ‘^ 

'^6 '^6 


a ? jS * y => 8 ^ 
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TRANSFORMATION AND NUMERICAL SOLUTION 
OF ALGEBRAIC EQUATIONS 

35. Increasing or Decreasing all the Roots of an 
Equation by the Same Amount. Given an equation 

/(a;) = = 0 . (1) 

we shall seek for the equation 

• ( 2 ) 

each of whose roots is h less than the corresponding root 
of f[x) = 0. In other words, we transform J{x) to F{y) 
by taking 

X = y-Yh. 

In the factorized form we shall then have 
f{x) = aQ{x~a)[x~^)...[x—^) 

= m- 

The roots of the equation ^'(^) = 0 are clearly a—h, ^—h, 
and so on, and these are of the required form ; and so we 
have found the required polynomial F{y), only in factorized 
shape. Without yet knowing the roots for x we can 
obtain F{y) directly by writing 

f{x) = ao(y+?i)"+ai(y+fe)”-^..+a„, . (3) 

expanding each term and then rearranging. In practice 
this is done by Horner’s method (16) p. 32. 

81 
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Examples. 1. To reduce each root of 



2a:3+3a; 

-4 = 

0 by 3. 

-2 

0 

3 

-4 (3 

3 

3 

9 

36 

1 

3 

12 


3 

12 

45 


4 

Is 

57 


3 

21 



7 

136 


X — y 


3 I 

10 

+ 102/3 +361/2+572/ + 32 = 0. 

2. Increase the roots of a/*‘+4a;3 — 19.i;2 — 106a; — 120 = 0 
by 3. Hence solve the equation: (i/* — Si/®— i/^-j-g?/ = 0). 

3. Decrease the roots of a;3+2a; — 2 = 0 by 0*7. 

(i/3+2-li/2+3-47i/-0-257 = 0). 

36. Removal of the Second or Third Term of an 
Equation. If the relation 35 (3) is expanded, the co- 
efl&cients of i/" and the next two highest powers of y are 
respectively 

^n{n—l)aQh^~\-{n—\)a^-\~a^ . (1) 

If we choose h = —ajna^ the second term in the resulting 
equation for y disappears. If instead we choose h to 
make the quadratic expression in n, the coefficient of 
vanish, then the third term of the equation for y 
disappears. Usually the same value of h fails to make 
both terms disappear and we content ourselves with the 
first step only. Since is rational the step is readily 

taken, and the resulting equation is manifestly simpler 
than the original. 
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Examples. Remove the second term from the eqiiations 

a;3_j_6a;2_7a;_4 = o, 2x^+x^-l = 0, a;«-8a;3+a:2~x4-3 = 0. 

1 1 53 

(Here h = —2, — 2 ; «/® — 19^+26 = 0, 2y^—-y—— = 0, 
6 o 54 

yi-23y^-61y-4cB = 0). 

Increasing or Decreasing all the Roots by a 
Constant Multiple. Let x — hy ] then 

f{x) = 

This is of special numerical use when k is a power of 10. 
For example, we can regard the equations 

a:H2- la: H3- 47a; -0*257 = 0, 
a;H21a;2+347a:-257 = 0, 
xH0*21a:H0'0347a:-0-000257 = 0 

as essentially the same. If the roots of the first are 
a, j8, y those of the second are 10a, 10;8, lOy, and of the 
third are a/10, j8/10, y/10. 

37. Horner’s Method of Solving an Equation. 

An equation may always be solved to any desired degree 
of approximation, as far as its real roots are concerned, 
by Horner’s method. This is best explained by an actual 
example : we shall take/(x) = x®—4x^-t-6x— 18984 = 0. 

Here /(O) is negative, so that the graph of f{x) crosses 
the axis x = 0 below the origin, but/(100) is positive, and 
the graph crosses the parallel x = 100 above the axis 
y = 0. Since the graph is continuous it must cut y = 0 
at least once between x = 0 and x = 100. By trial we 
find that /(20)<0, /(30)>0, so that a root lies between 
20 and 30. Now reduce all roots by 20. This gives an 
equation +56?/ ^4-10461/ = 12464 which must therefore 
have a real root between 0 and 10. Reduce the roots 
by 8. (The choice of 8 will be explained presently.) 
The new equation is 2^+802^+21343 = 0, where the 
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balance in the final column of the Horner scheme is zero. 
Accordingly z = 0 solves it : and a: = 20+8 = 28. 


-4 

16 

36 

66 

_8 

64 

_8 

72 

8 

M 


6 

320 

326 

720 

1046 

512 

1558 

676 

2134 


-18984 (28 
6520 
—12464 
12464 

I 0 


0 = 2 = ?/— 8 : 


The actual equation for z is 2®+8022+21342 = 0, which 
shows that 2 = 0, or else must be complex since 
802<4(2134). So the roots for x, being 28 greater than 
for 2 , are 28 itself and two complex values also. 

How, it will be asked, was the figure 8 chosen for the 
reduction oi y% Had 9 been chosen the balance in the 
third column would have been increased by more than 
612 and that of the fourth would have become positive. 
This last fact shows that /(20+9)>0 and therefore 29 
is too large a number. Also 7 would have been too small, 
for, except at the initial step, we always choose the highest 
digit which will reduce the numerical balance in the last 
column as much as possible without changing its original 
sign, from — to + or from + to — . 

Notice that in the working arrangement, as exemplified 
above, the successive reductions can be carried out as 
successive continuations of the first one. 

More often than not the root is a non-terminating 
decimal. In such a ease the balance in the final colu m n 
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is never zero. The following worked example is chosen 
for comparison with the preceding. 


Example. 


Solve a:3—4a:®+6.r— 20,000 — 0. 


-4 6 -20000 (28-4 

20 320 6520 


16 

20 

6F“ 

8 

_8 

72 

8 

8^0 

0-4 

80- 4 
0-4 

8 ^ 

0-4 

81- 2 


326 

720 

1046 

512 

1558 

576 

2134-00 

32-16 

2166-i6 

32-32 

2198-48 


— 13480 
12464 
-1W6-000 
866-464 
1-149-536 


The method is the same as before, but at the third stage 
a reduction of the root by 0-4 is effected (trial shows that 
0-5 is too large : it would make a i^ositive final balance). 
Since the operation is persistently that of adding 0-4 times 
the balance, it is hardly surprising to note that the balances 
in the second and third columns are altering relatively little. 
The change is less at each successive stage : at the next, 
with 0-06, we should have 


81-20 

2198-4800 

0-06 

4-8756 

81-26 

2203-3556 

0-06 

4-8792 

81-32 

2208-2348 

0-06 



-149-536000 (0-06 
132-201336 
-17-334664 


81-38 
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This leaves a negative balance in the final column ; 0'07 
would have given a positive. The next approximation is 
0-007 which adds on less than 0-6 in the third column. The 
fourth decimal figure in the answer would add on even less, 
but the number of significant digits in the balances is becoming 
unwieldy. Let us therefore redress the balances by deleting 
digits from the right — one from the last column but one, 
two from the last but two, three from the last but three, and 
so on. We then have 


81-38 


$1 


2208-2348 

0-569 

2208- 804 
0-569 

2209- 373 
0-06 

2209-43 

0-06 

2209-49 


-17-334664 

15-461628 

-1-873036 

1-767544 

-0-105492 

88380 

-0-017112 

15466 

1646 

1547 


99 

11 

n 

0 


X = 28-467847745... . 


Considering the precision of the result, which gives 
X to ten significant figures, the labour involved is by- 
no means unreasonable. We may note a few general 
principles : 

(i) After the first and perhaps the second digit has 
been found the remaining figures are successively given 
-with comparative ease. Each new figure is roughly given 
on dividing the balance in the final column by that in 
the next preceding column. (For example, the third 
significant fig-ure 4 in the illustration above is suggested 
by 1016000/213400.) But we must allow for a possible 
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increase in sucli a divisor ; as in this case 213400 rises 
to 216616. 

(ii) The decimal point may be omitted except in the 
answer. Instead, at each new stage the balances may 
be multiplied by 1, 10, 100, 1000 and so on, from left to 
right. Thus we attach one, two, three ciphers to the 
balance in columns two, three, four and treat them all 
as whole numbers. This is illustrated in the example 
which now follows. 

Example, a;®— 0‘4cc®+0’06tB— 20 = 0. 


-0*4 

0-06 

-20 (2-846785 

2 

3-2 

6-52 

re 

3-26 

-13-48 

2 

7-2 

-13480* 

3-6 

10-46 

12464 

2 

1046 

-1016000* 

6-6 

512 

866464 

*56' 

1558 

-149536 

8 

576 

132204 


213400 

— 17332 

8 

3216 

; 15456 

l2 ' 

”216616 

' -1876 

8 

3232 

1766 

*800 

1 219848 

-no 


4 49 110 

8^ 22034 


4 49 

808 22083 (* As in (ii) above.) 

4 

8Z2 ' 

(iii) Contraction may alternatively be employed, by 
deleting figures as already explained. It uses up the 
balances and produces a result to about twice as many 
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significaat figures as were reached at the stage when 
contraction began. 

(iv) At the 7 of the third decimal place in a; on p, 86 
the balances were 1, 81-38, 2208-2348, etc. Contraction 
gave 81, 2208-234, but 569 was entered as the approxima- 
tion better than 81x7=567. It was obtained by paying 
attention to the figures 38 just deleted. Similarly, the 
deleted 8 in the third column contributed to the next 
balance 2208-804. 

Again for comparison almost the same equation is 
dealt with on p. 87, but contraction takes place after three 
stages. 

38. Rational Roots of an Equation. If the co- 
efficients of an algebraic equation are complex we may 
replace the equation by one of higher degree whose 
coefficients are real and whose roots include all the original 
roots (5, Ex. 2, and 27, p. 64). If these real coefficients 
are irrational we may approximate to them by decimals to 
any degree of accuracy, in which case the corresponding 
roots are approximations also of any degree of accuracy 
(p. 69). Multiplication by a suitable power of 10 then 
removes the decimals and yields integer coefficients. 
Hence the limitation of the discussion to rational equations, 
with integer coefficients, is less of a restriction than at 
first sight appears. 

For example, V 3-|-i)a3+4 = 0 
is approximately equivalent, with fair accuracy, to 
l-414a:2-2-232a;-f 4 = 0, 
and with still more accuracy, to 

l-4142l£c2-2-23205.'cH-4 = 0, 

that is, to 

141421a;2-223205a; -1-400000 = 0. 
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Equations with Integer CoefEicieuts. We may 
always reduce a given equation to the form 

= 0 . . ( 1 ) 

A case of special interest arises when the coefficients 

a„ are aU integers, for the roots of the equation 
are then (6) p. 11, a-igebraic integers. If aU the coefficients 
Uq, %, ..., of ail algebraic equation are integers the 
reduction to the still more special form (1), in which 
Uq = 1, can always be carried out ; for we have only to 
take z — a^x as a new variable to obtain an equation in 
2 , namely, 

which is evidently of the form (1). 

For example, if 7x^—6x—5 = 0, and z = lx, we have 
49aJ® — 42a; — 35 = 0, that is, 2 ® — 6s— 35 = 0. 

Theorem. Each rational root of an equation with 
integer coefficients which is in the form (1) above is necessarily 
an integer. 

Proof. Let pjq, rational and in its lowest terms, be a 
root of equation (1). Then 

(^>/2r+«i(l>/2')"~^+.-+o« = 0. . . (2) 

Multiply through by q'^-^. Then every term is a whole 
number except the first, which is p”'jq, a fraction. This is 
impossible : hence pjq must be a whole number. 

To Find the Rational Roots of a Rational Equation. 

If Uq 9^:1 we first bring the equation to the standard form (1). 
By the factorized form of a pol 3 momial (25) p. 60, is now 
the product of the roots of the equation, and these if 
rational are integers, as we have just seen. Thus the only 
possible rational roots are to be found among the factors 
of with positive or negative sign affixed, and these 
must be examined one by one. 
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Example. 9a;* + 623 ® + 1 9a;2 -f- 6a: — 1 6 = 0. 

Instead of multiplying by 9^ it is enough here to multiply 
throughout by 9 and to put 3a: = 2 . This gives 

/(z) = z*+2z3 + 19z2 4-18z — 144 = 0. 

The factors of 144 are ±1, ±2, ±3, ±4, ±6, ±8, ±9, ±12, 
etc. Actual roots are 2, —3, so that (z-2)(z+3) is a factor 
The other lactor is z®±z±24, which leads to complex 

roots. Hence x — —1, and two complex values. 

O 

39. Other Methods of Solving Equations. 
Iteration. If the equation /(a;) = 0 can be readily 
thrown into the form x — <f){x) it is sometimes possible to 
solve it by iteration. Geometrically the problem consists 
in finding the x coordinate of a point Z common to the 
Ihie OP and the curve LM, whose equations are y x, 
and y = ^{x) respectively. 



Take any point A on OP and draw a staircase or spiral 
polygon, with lines alternately parallel to Oy and Ox 
and corners alternately on the curve and on the line OP 
In both figures the points A, B, C, tend to the desired 
limit Z, the point where OP cuts the curve LM. 
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Now the coordinates of B, C, ... are easily found from 
those of A. If A is {a, a), B is (6, h), etc., then evidently 
the ordinate y of Bis that of H, the point (a, ^(a)). Hence 
b — c = ^{b), d = ... . 


O ^ 

Whenever the sequence a, b, c, d, ... tends to a limit a 
it fimnishes a root a of the equation f{x) = 0. 

Example. x^—2x — 5 = 0. 

q 

Take this in the form x = ^{2x+5) and let a = 2. 

Then 6 = ^9 = 2-08.. and so 2h+5 = 9-16.. 

Thus c == ^9-16 = 2-092.. and 2c+5 == 9-184.. 

d = ^9-184 = 2-0942.., and in the same way 
e == 2-0945..,/ = 2-09455.. 

Actually a = 2-09455148... 

The process is available whenever the repeated 
calculation of can readily be effected. While the 

choice of the initial value a is largely arbitrary, it may 
quite possibly lead to a divergent sequence. For example, 
had A been taken to the left of the point K in Fig. 1 
it would have led to a staircase diverging from K, as 
experiment will at once show. Unless the curve were to 
recross the line PK produced no limit would then be found. 

Also in Fig. 2 if the curve at Z crosses OP at 45° down- 
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wards, that is, if g5''(a) = —1 at (a, <f){a)) then difficulties 
may arise. Indeed, if the arc LM is symmetrical with 
regard to OP, cutting OP at right angles, the spiral polygon 
becomes a square on diagonal AB, endlessly repeated : 
and G coincides with A. 

The method of iteration has the agreeable property 
that it is not seriously vitiated by an intermediate mistake 
or two committed in the calculation at any stage, provided 
that the sequence of results points to a clear limiting 
value ! Had c been taken as 2-082 in the example this 
would merely have been equivalent to choosing a value 
of a rather less than 2. The values of d, e, / would stiU 
have 2-09 as their leading digits. 

This pleasing feature makes the method popular. For 
a more detailed account of it the reader should consult 
Whittaker and Robinson, The Calculus of Observations, 
pp. 78-84. The method is due to James Gregory and 
Michael Dary independently in 1674. The same principle 
was also used by Newfon (1707). 

40. The Approximate Method of Newton. If the 
curve y = f{x) is knovm to cut Ox near a point x = a, 
we may proceed as follows : Choose a point A near to Z 
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on the axis of x : draw the ordinate AP to cut the curve 
at P, and the tangent F£ to cut Ox at £. Draw the 
ordinate £Q to cut the curve at Q and the tangent QG 
to cut OX at G. Repeat the process. J.iy = f{x) denotes 
the curve and if both f'{x) and /"(x) do not vanish in the 
range AB in the first figure dr AZ in the second, then this 
procedure, repeated again and again, yields points A, 
B, G, ... which approximate to the point Z, as the actual 
construction shows. 



In tliis -way we find a series of successive approximations 
to the root a of the equation f{x) = 0, which lies near a 
given value x = a. 

Analytically, let y—fia) =f'{a){x—a) be the equation 
of the tangent at P, the point {a, f{a)). Then OB or b 
is given by putting y = 0, so that 

h = a—f{a)/f'{a). 

Similarly, c = b—f'{b)/f(b), and so on. 

Example. x^—2x—5 — 0. 

Here/(2) = — 3,/(3) = 16, so that a root lies between 2 and 3. 
Put a = 2. Then 6 = a— (a®— 2a— 5)/(3a*— 2) = 2*1. 
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Also 


c == 2‘1 


2-13--4-2-5 

3x2-12-2 


= 2-1 


2 . 0946 . 

13-23-2 


J = 2.0946 -. 

3x2-09402-2 


= 2-0946 -0-00054155/11-162 
= 2-094551483, 


•where the last figure 3 is approximate. 


As the above example may suggest, Newton’s method 
in favourable cases is powerful and rapidly convergent. 
It is applicable not merely to algebraic equations, but to 
transcendental equations, provided that f'{x) is readily 
calculable : for example, the equation x = tan x can be 
solved with sufficient accuracy, provided that we have 
tables from which sec^aj can be interpolated. The method 
is also iterative, and so enjoys the advantage already 
mentioned, namely, that it is not vitiated by an error at 
an intermediate stage. Newton elaborated a geometrical 
form of this method in the Scholium of Proposition 31 
Book I of the Principia (1687), where he applied it first 
to the equation x—e sin x = N, and next to e sinh x—x = N. 
These equations arose out of Kepler’s Problem, to find the 
position of a planet at a given time in an elliptic or hyper- 
bolic orbit of eccentricity e. 


Examples. 

1. Show that e® — 1 = 2x has one positive real root, and 
approximate to it by Newton’s method. [Take a = 1.] 

2. Determine the real roots of a:® — 10x’2d-4 = 0 to two 
significant figures. 



OHAPTEE VII 


THE LOCATION OF THE ROOTS OF AN EQUATION 

41. The Significance of the Sign of a Polynomial. 
Let us consider the graph of the real polynomial 

It is a continuous line which crosses each line parallel to 
the axis a; = 0 once, and it crosses the axis «/ = 0 , or 
touches it, at those values of x which give the roots of the 
equation /(x) = 0. If /(u)<0 and f{b)>0 we say that y, 
or f{x), changes sign as x passes from a to b. Now y is 
negative at x = a, changes continuously in value as 
X changes, and becomes positive when x — b. Unless y 
vanishes for some value of x between a and b this could 
not happen. Hence there is a root between x = a and 
x=b. Similar reasoning applies if/(a)>0 and/( 6 )< 0 . 

Now consider the graph itself with its possible ups and 
downs. Let A be the point on the graph where x — a, 
and B be the point where x = b. 



If /(a)< 0 , f{b)>0 there may be one real root between 
A and B (as when B is at Bj in the figure) : or three real 
roots (as when B is at B 3 in the figure), but certainly not 
two real roots, or four real roots. 
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If the graph touches the axis y = 0 "we can regard 
the point of contact as the point of coincidence of two 
roots of the equation. 



If the graph touches the axis 2 / = 0 at an inflexion, 
we can regard the point of inflexion as the point of 



coincidence of three roots of the equation. And so on. 
The characteristic property is therefore as follows : 

Theorem. An odd number or an even number of real 
roots of an equation f(x) == 0 lie between two values x = a 
and X = b, according as f(a), f(b) differ in sign or have 
the same sign. 

Corollary. An equation of odd degree must have at least 
one real root ; and equation of even degree need not necessarily 
have a real root. 

For if X is large f{x) behaves like (cf. p. 21), 
where without loss of generality we may take a^ = 1. 
If x<cO so is x^ when n is odd. We write /( — oo)<;0. 
And if x>0 so is a;", and so /(oo)>0. Hence f(x) changes 
sign at least once, and there is a real root. But if n is 
even/(— 00 ) and/(+oo) are both positive and the function 
f(x) may indeed never become negative. 

Examples, a;® +4, cc* +2a;®4-4 are functions that cannot be 
negative for any real values of x. 
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42. Upper Bound to tlae Real Roots, and Newton’s 
Rule. Let us take the equation with «„ = 1, so that for 
large enough values of x the function f{x) is positive. If 
b is such that f{x)>0 whenever x>b we call b an topper 
bound (it used to be called an upper limit) to the real roots 
of the equation fix) = 0. Manifestly it is a help, in the 
search for the roots, to ascertain that they are confined 
below such a definite value b. Obviously no root can 
exceed b, a value of x from which onwards f{x) is always 
positive. There may also be numbers less than b which 
may yet exceed each root of fix) = 0 ; the bounds may 
be drawn more tightly. 

Newton gave a rule to the following effect : if f(b), 
f'{b), f"(b), ..., f'”^(b) are all positive, b is such an upper 
bound to the real roots of the equation f(x) = 0. 

The rule follows at once from Taylor’s formula 

fix) =fih+x-h) 

since the right-hand expression is positive when fib) and 
its n successive derivatives are positive, and x>b ; so that 
fix)>0 whenever x>b. 

Such a bound b is of course not unique, for any point 
to the right of b on the axis of x would answer equally 
well. 

Examples. 

1, fix) — X®— 9x2-1-4x4-90 = 0, 
f'ix) = 3x2 — 18x-f4, f"(^x) — 6x — 18. 

Here /"(x)>0 if x>3 and /'(x)<0 at x = 3, 4, 5, bixt 
/'(6) >0, /(6) >0 ; hence no real root exceeds 6, which is an 
upper bound. 

2. For X® — 12x2 4-57x— 94 = 0, 6 = 4. 

Lower Bound to the Real Roots. The equation 
= 0, which reverses the given equation, has for 

(Jr 
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its roots the reciprocals a~^, of the roots a, 

j8, A of /(a:) = 0. For 

= ao(l— aa;)(l— ^a:)...(l— Aa:), . . (1) 

and the roots are ar^, ..., A“^. Now if no root lies 
between 6 and +oo, when b is positive, then no reciprocal 
lies between b~^ and zero. Accordingly, if 6 is a positive 
upper bound of the reciprocal equation, 6~^ is a lower 
bound for the positive roots of the original equation. 

Example 3. The reciprocal equation to f{x) = 0 in 
Ex. 1 is 90a;®+4a;* — 9x + l = 0, 

and the derivatives are 

2T0x^-\-8x—9 and 540a; + 8. 

Now a: = 1/6 renders these three polynomials positive, and 
so no root lies between 1/5 and +co, so that no root of the 
original equation lies between 5 and 0. We infer from 
Examples 1 and 3 that real positive roots of 

9a;2+4a;+90 = 0 

can only lie between x = 5 and x — 6. 

A lower bound to all the real roots can be found by 
considering the upper bound of f{—x) = 0, the roots of 
which are —a, — jS, ..., —A ; and, finally, an upper bound 
to the negative roots is given by considering «"/(— a:-i) = 0, 
the roots of which are — a“^ — A“^. 

Example 4. If ^{x) = a;® + 9a;® + 4a;— 90 == 0, 
then <^(— a:) = 0 is the equation of Ex. 1. The derivatives 
are positive for x>0, but (^(0)<0, ^(2)<0, ^(3)>0. Hence 
a; = 3 is an upper bound ; and no root of the original equation 
of Ex. 1 is less than —3. 

Again, when f{a)<0, /(6)>0, and J'(x)>0 for a<x<b, 
then f{x) = 0 has exactly one root between a and b ; for 
the graph must be climbing all the way between x — a and 
X =b. In Ex. 4 there is exactly one root between 0 and 3. 
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43. Relative Location of Roots of an Equation and,' 
of the Derived Equation. The follo'wing important and 
useful theorem, first published by RoUe in 1689jd§^true 
for functions more general than polynomials, but is" ap;gted;“ 
here to algebraic equations. • , 

Rollers Theorem. Between any two consecutive roots of 
the algebraic equation f(x) = 0 there lie an odd number of 
roots of the equation f'(x) = 0. 

Since the graph y = f{x) is continuous, f{x) must have 
a maximum or minimum at least once between any two 

A 


:s 


values X == a and x — ^ for which f{x) vanishes. In fact 
f{x) caimot always increase as x changes from a to j8, 
if f(a) = f{B) : nor can it always decrease. Again, at such 
a maximum or minimum the gradient of the tangent is 
continuous, so that/'(a;) exists but is neither >0 nor <0. 
It can only be zero. 

Once more, if an even number of zeros of/'(a;) lie between 
two points P and Q on the curve, then the gradients of 
the tangent at P and Q are both positive or both negative. 
Therefore if the curve is ascending through P it is ascending 
through Q : and this is impossible when P and Q lie at 
consecutive roots of the equation f{x) = 0. Similarly for 
descending values. Hence f'{x) vanishes an odd number 
of times between two consecutive roots of f{x) = 0. 

Corollary. The roots of f"(x) = 0 separate those of 
f '(x) = 0 in a similar manner. 

For an illustration see p. 19, Ex. 1. 

44. The Harriot-Descartes Rule of Signs. We now 
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proceed to investigate a remarkable theorem, implicit in 
the work of Harriot but first used explicitly by Descartes 
(1637), which limits the number of positive or negative 
roots of an equation ; but first we require some preliminary 
definitions. 

When a polynomial is 

arranged in descending order of index and has non-zero 
real coefficients, a continvMion is said to occur whenever 
the signs of two consecutive terms are the same, and a 
change is said to occur whenever these two signs are 
contrary. Thus in the polynomial 

a:®— 3a:’+4a;® — oc^ — 2x^~3x^-\-2x-i-B 

three continuations occur, following the terms — a^, 
—2a:®, -\-2x respectively, and four changes of sign occur, 
following the terms x®, — 3x'^, + 4x®, — 3x® respectively. 

When n, m, p, ..., q are consecutive positive integers 
the polynomial /(x) is said to be complete. All these remarks 
apply also to the associated equation /(x) = 0. The theorem 
in question exhibits a connexion between these changes 
of sign and the number of positive roots of the equation. 
We begin with the following lemma : 

Lemma. If a real polynomial, complete or incomplete, 
is multiplied by x— a, where a>0, the product will contain 
at least one more change of sign than the original. 

(i) Proof for the complete polynomial. Let the 
multiplication be performed in the ordinary way, but let 
the signs only of the terms be written down ; then we shall 
have a scheme such as 

+ + + — — + — — — -|- 

+ - 

+ + + — — + — — + + — + 

— — — — + -|- — — + — 

+ ±±-^+~T+±- + - 
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The top row represents the complete polynomial written 
in descending order, wherein any arbitrary sequence of 
signs may occur. The second row represents x—a, where 

the signs are fixed as -j . A double sign is placed 

where the sign of any term in the product is ambiguous. 
The following laws will be seen by inspection to hold : — 

■(1) To every group of consecutive continuations in the 
original there corresponds a group of the same number of 
ambiguities in the product pol 3 momial. 

(2) In the product polynomial the signs before and 
after an ambiguity or group of consecutive ambiguities 
are contrary. 

(3) In the product polynomial an extra term appears 
at the end, and with it a change of sign is introduced. 

Now in the product pol 3 momial take the most un- 
favourable case and suppose that all the ambiguities are 
replaced by continuations ; then, by the second law, 
without affecting the number of continuations, the upper 
signs may be adopted for the ambiguities ; and thus the 
signs of the original polynomial will be repeated in the 
product polynomial, except that by the third law there 
is an additional change of sign introduced at the end of 
the new polynomial. Thus in the most unfavourable case 
one more change of sign occurs in the product polynomial. 

(ii) Proof for the incomplete polynomial. Introduce 
zeros, together with the signs -|- and — , in the 
multiplication scheme, in order to indicate exactly the 
missing terms ; for example, 

-f- -|- 0 — 0 0 “1“ — 0 — -j- — 0 ~f- 

0 H 00 hO hO- 

Between each zero, or group of zeros, is a complete subset 
of terms to which the preceding case applies, so that 
each subset gains at least one change of sign. But there 
may be a loss of a change at the actual gap, as in the 
first zero gap of the illustration above, where + 0 — 
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becomes . Now the number of complete subsets 

separated by zero gaps is one more than the number of 
these gaps : hence the certain gains outweigh the possible 
losses by at least one. The result is thus established. 

Descartes' Theorem. In any equation, complete or 
incomplete, the number of positive roots cannot exceed the 
number of changes of signs of the coefficients, and in any 
complete equation the number of negative roots cannot exceed 
the number of continuations in the signs of the coefficients. 

Proof. Resolve f{x) into f>{x){x — a)[x—b) ... {x—c), 
where ijj{x) contains all the factors due to negative and 
to pairs of complex roots, while all the factors x—a, and 
so on, due to positive roots are explicitly given. Now 
multiply ifj{x) by x — a, x — b, ... in turn. At each 
multiplication at least one change of sign, by the lemma, 
is introduced into the product, so that the first part of the 
theorem follows immediately. 

To prove the second part we suppose /(x) to be complete 
and put —y for x ; then the original continuations of sign 
become changes of sign. Also the transformed equation 
cannot have more positive roots than it has changes ; 
and thus there cannot be more negative roots in the 
original complete equation than the number of its 
continuations of sign. 

Corollary. An equation, complete or incomplete, cannot 
have more negative roots than f(— x) has changes of sign. 

Tor the negative roots of f{x) = 0 are the same as the 
positive roots oif[—x) ~ 0. 

Examples. 

1. x^-\-2x—2 = 0 has one change of sign and may there- 
fore have one positive root. Actually it has a root between 
0 and 1 since /(l)>0>/(0). Now 

(— a;)3-f2(-a;)-2 = 0 

has no change of sign, and therefore has no positive root ; 
and so the original equation has no negative root. It has 
two complex roots. 
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2. X® — 1 = has one real root, namely x — 1, 

3. — 1 = has two real roots, namely x = 1, a; = —1. 

4. ~2x^ —5 = 0 has one real root, positive. 

5. x*’-{-x^-\-l — 0 has no real root. 

■45. A Precise Rational Test for Real Roots. 

Remarkable as the Harriot-Descartes Rule of Signs is, 
it still leaves an uncertainty as to the exact number of 
real roots in an equation : it only gives an upper limit 
to them. The problem of finding an exact test engaged 
the attention of mathematicians for the next two hundred 
years, until it was finally solved in 1829 by Sturm, who 
published his result in the Memoires Divers des Savants 
Strangers, Paris, 1835. Sturm showed how to find for 
any equation, by rational methods, the exact number of 
real roots which lie within any given range of values. 

Sturm's Theorem. There exists a set of real polynomials 
f(x), f'(x), f 2 (x), ..., fm(x) whose degrees are in descending 
order, such that, if b>a, the number of distinct real roots 
of f(x) = 0 between x = a and x = b is equal to the excess 

of the number of changes of sign in the sequence f. f'. fa 

fjji when X = a over the number of changes of sign when 
X = b. 

Proof. First suppose that f{x) has no repeated factors. 
Take f{x) — a:” -i-OiX '^-^ +...+«„, 

f'{x) its derivative, and f^ix) the remainder, with its sign 
changed, of the division f{x)/f'{x). Continue the G.C.M. 
process until a constant is necessarily reached, since / 
and /' have no common factor (p. 61) ; but at each of 
the m— 1 remainders change its sign before continuing. 
These modified remainders are the required functions ; 
and we have the following relations : 

/ = Qif -u 
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where each f or q indicates a polynomial in x or, in the 
final case, a constant 

From these relations we can draw certain inferences : 

(1) else all the preceding f’s have a common 
factor involving x. Hence the constant has a permanent 
sign -j- or — , whatever value attaches to x. 

(2) No two consecutive /’s can vanish simultaneously ; 
else they and all their predecessors would have a common 
factor. 

(3) If any /» = 0 then, at this value of x, and 
/i_i have contrary signs, since they must be equal and 
opposite in value. 

Now consider the sequence of signs in the set of /’s 
when X = a, which is taken at a place where none of the 
f’s vanish. As x increases from x = a no change of sign 
within this sequence can occur until x reaches a zero of 
one of the f’s. We therefore examine what happens at 
such a zero. 

If f^ = 0 Sbt X = c, take a range {c—h, c+h} covering 
the value c, and such that no other / vanishes within the 
range. This is possible since the zeros of all the f’s are 
definitely distinct. The signs of /j-_j at a; = c — h, c, c-\~h 
are the same, let us say + + + : those of /j.+i are therefore 

, by (3) above : while those of /,• are i, 0, dr- 

Accordingly the signs of /,-, /;+! are + i — just before, 
-f 0 — at, and + i — just after x = c. In each case this 
group therefore presents one permanence and one change 
of sign as x passes through a zero of ; and the same 
happens if fi_^ is negative a,t x = c. 

This argument is equally vahd for each /»• including /', 
but not for the initial member / of the sequence. As x 
increases and passes through an unrepeated root a of the 
equation / = 0, the signs of / must be either + 0 — or 

— 0 +j while the corresponding signs of /' must be 

or + H — hj according as / is decreasing or increasing 
through the root. In either case the signs of /, /' differ 
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before, and are the same after, the passage of x through a ; 
hence one change of sign is lost. 

Thus we have proved that as x increases, Sturm’s 
functions never lose a change of sign except when x passes 
through a root of the equation / = 0, and never gain a 
change of sign. Hence the number of changes of sign 
lost as X increases from a value x = a to a. greater value 
a? = 6 is equal to the number of the roots of the equation 
f(^x) — 0 which lie between a and b. The theorem is thus 
established for the case of unrepeated factors. 

Consider next the case where f{x) has repeated factors. 
Then /, f' possess a G.C.M. {x—a)^{x—^)‘^...=0, 
which is common to all the Sturm functions / /j, ..., 

Let us write / = 0<l>, f = Gcf>^, == Gcf>^, /„ = Gcj)^, 
where necessarily 

0 = {x—a){x—fi) ... [x—X) 

and no two functions have a common 

factor. By the previous case the number of real zeros 
of <f> can be ascertained from the Sturm sequence of </i’s. 
At any value x — c for which G^O, the signs of the f’a 
will be either identical with those of the <^’s (for G>0} 
or else will be entirely contrary (for G<0). In either case 
the / sequence has the same number of changes of sign 
as the ^ sequence. At a value x = c for which G — 0, 
no signs appear at all, for all the / terms are zero. Hence 
the examination of the sequence of signs of the / series 
between x = a and x = b discovers the number of real 
roots of the equation 0 = 0, that is, the number of real 
distinct roots of / = 0, since every zero of / is included 
in 0. This completes the proof of the theorem. 

Corollary I. // fj. is a remainder involvmg x and such 
that fr remains positive, or remains negative, for all real 
values of x between a and b, then the sequence need not be 
prolonged beyond 

For in the preceding demonstration the necessary 
property of the last / was that it should never vanish, 
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and as cannot vanish. th.e argument holds for the series 

Corollary 2. Any f can be multiplied by a positive 
constant, or a factor involving x, provided that the factor 
remains positive throughout the range in question, a7id the 
modified function can be used for computing all further 
terms of the sequence. 

For this modification leaves the essentials of the above 
proof unaltered. 

Examples. 1. If /(a:) = cc®— 4a7 + 13 = ^ 0, 
f'(x) = 3x^-— 6a;— 4, 

/2(a;) = 2a;— 6, 

The computation can be arranged as follows ; 


1 _3 _4 +13 

3 -6 -4 

3 _9 -12 +39 

6 -12 -8 

3 -6 -4 

6 -16 

-3 -8 +39 

+ 3 -8 

—3 +6 +4 

+ 3 -11 

-7 )-14 +36 


2 -5 



This is the ordinary G.C.M. process ; only after each remainder 
/2 and /a the signs have been changed. To avoid fractions, 
/ has been multiplied by 3 before the division. Division by 
— 7 changes the sign of the first remainder and gives a suitable 
/g, by Corollary 2. Next a scheme of signs can be devised : 



/(a?) 


fz{x) 

U^) 

Changes 

CO 

— 

+ 

— 

+ 

3 

10 

— 

+ 


+ 

3 

0 

+ 

— 


+ 

2 

1 

+ 

— 

— 

+ 

2 

2 

+ 


— 

+ 

2 

3 

+ 

+ 

+ 

+ 

0 
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In the first column we can wi’ite arbitrary values of t ; 
in the last column we write down the number of changes of 
sign in the sequence. Here there is one negative root between 
-10 and zero, and two positive ones, between a; = 2 and 
a; = 3. Between which integers is the negative root ? 

2. If /(a:) = 2a;«-13a;H10a;-49, 

f[%] = 2(4ic®-13ir-f5), 
jlx) = 13a;2-15a;+98. 

Here has no real factors and is always positive for real 
values of .r. The signs at a: = - oo are + - + and at a: = + oo 
are + + + ; so that there are two real roots, as shown by 
two losses of change in sign. At a; = 0, the signs are 
implying one negative and one positive root. 

3. Locate the roots of a;Ha;®-4a:+l = 0. 

4. Locate the roots of a;*-5a;H8« = 10 and evaluate 
a root, l<a<2, to three decimal places. 

5. Find the real root of 48a;® = 3a;®-f 3a:-f 1. 

6. Also of a:®+5a:Hit=27. 

7. The equation a;®-3a;+ 1 = 0 has three real roots, a, jS, y. 
Prove also that the roots of a)®-3a;H3 = 0 are p+y+l/a 

and two similar expressions. 
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BINOMIAL AND RECIPROCAL EQUATIONS 

46. The Binomial Equation. If each coefficient of 

f[x) = = 0 . . (1) 

vanishes except two, the equation is binomial in form, 
and reduces, apart from zero roots, to 

= 0 , ( 2 ) 

which is called the binomial equation. By means of 
Demoivre’s theorem this can always be solved. 

There are two cases, the arithmetical and the complex. 

The Arithmetical Case. In this case we assume that 
a>0. The equation x'^—a = 0, accordingly, has not 
more than one positive root, by Descartes’ Rule of Signs : 
and since /(0)<0, /(oo)>0 it has exactly one root, which 
we denote by !J/a, and call the arithmetical nth root of a. 
Here a may be a perfect nth power of an integer or rational 
fraction, in which case x is rational, or a may not be a 
perfect nth power of a rational number, in which case x 
is irrational. 

Example. If a3®~32 = 0, x = 2. If x®— 36 = 0, x is 
^36, an irrational number. 

The root may be computed as accurately as may be 
desired in various ways— from a graph for preliminary 
location, by logarithms, by the binomial theorem, or by 
Horner’s method. 

The binomial theorem is most effective when a is nearly 
a perfect nth power. Here, for instance, 

108 
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= (32+4)1/5 ^ 2(1 + 1 / 8 ) 1/5 
= 2 ( 1 + 1 / 40 - 1 / 800 +...) 

= 2 - 0475 ... 

from three terms of the series. The use of a seven-figure 
table of logarithms hardly improves on this. 

Example. Find the real root of as®— 60 = 0 to seven 
places of decimals by Horner’s method, comparing the result 
with that given by seven-figure logarithms. 

The Complex Case. Let a be written in the complex 
polar form 

a = r(cos 0+i sin 6), 

where r>0 ; and let 6" = r, where b = ^r, the arithmetical 
nth. root of the modulus r of a. If we now put x = by the 
equation oj” = a reduces to 

= cos d-\-i sin 6, 

which is solved by taking 

0+2577 . . 0+2S7T ^ „ 

y = cos \-i sm , 5 = 0, 1, 2, ..., n—\. 

n n 

Proof. If y has this value, then by Demoivre’s theorem 

yti _ eos(0+2577)+i sm(0+2s7r) 

= cos 0+i sin d 

for each value of 5. Hence the stated value of jy is a root 
of the equation. 

Also for the n given values of s, y takes exactly n 
different values, which are represented by points on a 
circle of unit radius whose amplitudes differ successively 
by 277 / 71 ; and such points lie at the n vertices of a regular 
polygon inscribed in the circle. Hence each possible root 
of this equation of the nth. degree has been identified, 
for there cannot be more than n roots. 
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Furthermore, if s takes higher values n, w + 1, or 
else negative values, the same vertices are repeated. For 



example, the point ISTg in the figure is given hj n ~ 6, 
s = — 4, 2, 8, 14, .... 



Geometrically the solutions of the equation 
a;" ~ eid =z cos d-\-i sin 6 
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where 6 is, of course, real, are given by the following 
construction ; Draw a circle with centre 0 and of unit 
radius OA. Take P a point on it such that the angle 
AOP is 6. Mark the point on the circle such that the 



angle AON^ is djn : then mark the n vertices of a regular 
inscribed polygon The complex numbers 

answering to these n vertices are the n distinct nth roots 
of The figures exhibit the cases n = 5 and 6. 

General Solution of x”' — a. By combining the 
above results we obtain the n values 


I j-2s7T . . 0A-2s7r\ 

X = ( cos sm 5 = 0, 1, 2, n— 1, 

n n ) 

where \a\ denotes the modulus of a, and 0 is its amplitude. 


Examples. 

1 . 0^3 + 8 = 0 . 


Here 
and so 


— 8 == 8(cos TT+i sin tt), 


X 




, 2 (cos 77 sin tt), 


„/ RtT . . 57t\ 

2^cc,-+.sm-j 


= 1+iVs, —2, l—iVs. 
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The regular polygon in this case is an equilateral triangle 
with one vertex at the point (—2, 0). 


2^77 2^77 

2. — 1. Here a ~ I, cb = 0, x = cos [-i sin * 

7 7 


3. = i. Here a 


cos sin 
2 2 


(45-rl)^ . . (45-j-l)'3T 

* -= 008 — jj— +. Sin ■ 


4, The real roots of — 1 =0 are ±1, if n is even, and 
+ 1 only, if n is odd. On the other hand, a;^^-|-l == 0 has a 
single real root — 1 if n is odd, and no real root if n is even. 

5. Group the complex roots of a?” — 1 =0 into conjugate 
pairs. Hence resolve x'^ — 1 into real factors. 

_ _ 2577 . . 2577 _ . _ 

(Take x = cos — sis such a pair. Then the 

n n 

quadratic polynomial 



n 


vanishes for these two values of x ; that is, x^—lx cos + 1 

n 


is a factor of a?” — 1 for integral values of 5 . 

If n is odd there are 1) such quadratic factors, where 

5 = 1, 2, 3, 'Kn — 1). If n is even, there are |■(n — 2) such 
factors with 5=1, 2, 3, ..., -|■(n~2). This accounts for all 
the complex factors, but there are also real linear factors, 
namely (a; — l)(irH-l) when n is even, and x — 1 when n is odd.) 


6 . x ^-1 = 

7. x^-l = 




— 2x cos ^ + l)|a;2— 2a: cos 


x^—2x cos 




x^~2x cos 



47. Euclidean Construction of the Regxxlar Polygon. 

The solution of the binomial equation a;”— 1 = 0 and the 
construction of a regular polygon of n vertices are evidently 
closely connected, and already we have considered a general 
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trigonometrical form of tte solution. But there are 
other more purely geometrical or algebraical methods at 
our disposal, and it is natural to find out how far, for 
instance, the time-honoured ruler and compass construc- 
tions of Euohd would carry us in forming a regular polygon 
of n sides. Euchd gave us the triangle, square and 
pentagon (n = 3, 4, 5) and any figure deducible from 
these by repeated bisection or by combination of cases. 
In fact whenever n = 2™.3.5 a regular figure could be 
drawn by ruler and compass. 

For example, to construct a regular 15 -gon, take the arc 
of the pentagon, which is a fifth of the circumference, from 
the arc of the triangle, which is a third, and bisect the resulting 
arc. 

But how shall we deal with such cases as w = 7, 9, 
or 11 ? Great interest has been taken in such problems 
throughout the ages, and notable results were found by 
Lagrange and Legendre, who examined the corresponding 
algebraic equations ; but it was Gauss who made the 
greatest discovery in this field since the time of Euclid, 
for in 1796 Gauss proved that the 17-sided regular polygon 
could be drawn with ruler and compass. In fact he proved 
that the case when 

n = 22»»-|-l 


is always possible provided that n is prime. The lowest 
values of n satisfying this condition are w = 3, 5, 17, 
267, 66537. 

Algebraically, the use of linear and quadratic equations 
is equivalent to such ruler and compass constructions ; 
and it ca-n be shown that the solution of the equation 
a;” — 1 = 0 is reducible to that of quadratic equations by 
what are otherwise rational processes, in precisely those 
cases in which, geometrically, the ruler and compass 
succeed. We cannot attempt any proof here, but one 
illustration can be given. 

H 
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Example. Take n = 5, a:® — 1 — 0. Segregate the factor 
X — 1, leaving 

a;*+a;®+a:®+a; + l == 0. 

This is a simple instance of a reciprocal equation (48), To 
solve it we put y — x-\-l/x, so that = a;® +2 + 1/a;®. Hence 

x^+x+l-\-l/x + l/x^ == 2 /^+ 2 / — 1 = 0. 

We can solve this equation for y, and we can also solve the 
quadratic equation corresponding to the substitution, 

x^—xy-{-l = 0, 

as a quadratic for x in terms ofy; in fact we have 

X = i 2 /±'\/i 2 /^ — ], where y = — 

The quartic equation in x has thus been solved by the 
successive use of two quadratic equations. 

48. Reduction of Degree of Certain Eq;uations. 
Reciprocal Equations. In the first place there are 
certain rather special types of equation which are reducible 
to equations of lower degree and root extractions. It is 
easy, for example, to reduce equations such as 

= 0, or -\-c = 0 

by the substitution y — a:”. 

Examples. 

x«+9a;3+8 = 0, (cc®+3a;+2)2 — 4(a:® + 3.T+2) + .3 = 0. 

Such instances are easy to devise, but they seldom 
occur in practice. A more important and general case is 
that of the reciprocal equation ; 

where a, = for s = 0 , 1, 2, ..., n. 

Theorem. The reciprocal equation of even degree is 
rationally reducible to an equation of half that degree. 

Proof. Put z = x-\-llx, so that = x^+2+llx^, 
— x^-{-Zx~\-^jx-\-ljx^, and so on. Then 

x-\-\lx = z, x^-^-ljx^ = — 2, a:®+l/a;® = z^ — 3z, 
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and so on. By this iteration we express 

that is, as a pol 3 momial in 2 ; of degree s, for each value 
of s in succession. Now put n = 2m and divide the given 
equation throughout by x'^, so that it can be rearranged as 

Substituting for each bracketed expression its equivalent 
as a polynomial in z we obtain an equation in z of degree m. 

Corollary. A reciprocal equation of odd degree 2m +1 
can he rationally reduced to one of degree m. 

Proof. In such a reciprocal equation f{x) = 0 we find 
at once that /(—I) vanishes : hence x-\-l is a factor of 
f{x). If f{x) is written then ijj{x) is of degree 

2m and is also reciprocal, since the roots of ijj{x) = 0 
are those off{x) = 0, with the root —1 removed ; and the 
roots off{x) = 0 are reciprocal (42, p. 98) in pairs. 

The preceding theorem now applies, and reduces 
f{x) = 0 to an equation of degree m. 

Analytically, /(£c) = x‘^f(xr^) is the relation characteristic 
of a reciprocal equation f{x) = 0 of degree n. 

Certain equations are reciprocal in respect of the 
numerical values of the coefiS-cients, but alternate in the 
sign of the coefficients, in such wise that the terms may 
be grouped as 

x—ljx, x^-\-l/x^, x ^ — l/»®j ... . 

When this happens the transformation z = x—ljx will be 
effective. 

Other equations can sometimes be transformed to the 
reciprocal form. For example in 

l&x^+^<ix^+^bx^+'^ax-\-l = 0 

we should first put 2x — y, then y+l/y — z. The equation 
then reduces to 2 = 0. 
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Examples. 

1 . Solve = 0 . 

(Put x^ljx = z, Then 6 ( 2 ^- 2 ) +52-38 = 0, so that z = 2| 
or -3|. Hence x+ljx = 2-| or -3+ and x = 2, + -3, 

2 . Solve 2x^-\-lx^-{-k^-\-^x'^-\-lx-\-2 = 0 . 

(Divide the polynomial by ai+l, getting 

2a;H53J®+4a;^+5a:+2 = 0. 

The solutions are x = ~l, -2, — ±i) 

3. Solve 2a:H3a;®-4a:^-3a:+2 = 0. 

(Put x-ljx = z. Solutions are a; = 1, ~1, -2, |.) 

4. Solve 6a:H35a;3+62x’H35a;+6 = 0. 

5. Prove that tho points of the Gauss plane representing 
the five roots of ( 2 +I)® = 322® are concyclic. 

6 . Calculate the fourth roots of 8(-l+iy'3). 
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THE CUBIC EQUATION 

49. Historical and Introductory. Examples of cubic 
equations have recently been discovered by 0. Neugebauer 
among ancient Babylonian records. Cubic equations 
were considered by Diophantus about a.d. 300, but the 
first European mathematicians to give a complete solution 
of them belonged to the Italian school of Bologna at the 
time of the Renaissance : they were Scipio Ferro, Nicolas 
Fontana, surnamed Tartaglia (that is, the Stammerer), 
and Cardan (who visited St Andrews in Scotland in 1552). 
The solution which usually bears the name of Cardan, 
who brought it into prominence, is really due to Tartaglia. 
The general cubic equation is 

= 0. (1) 

By taking a: = y— we can reduce it at once to the 
form 

jy+ laf- iajaa-ha3 = 0. . (2) 

This equation can now be written as 

x^-\-ax-\-b — 0 (3) 

which we shall call the reduced form of the cubic. It lacks 
a term in x^ : otherwise the coefficients are general. 

If happens to be zero it is quicker to put x = Ijy, 
in order to reach a reduced form of cubic equation. 
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50. Cardan’s Solution of the Cubic. To solve 

x^-\-ax-^h = 0 . . • (1) 

put X = z~{-v. Then 

x^ = z^-{-v^-\-3zv(z-\-v)\ 

= z^-\-v^-{-Zzvx. ] * • ( ) 

Equation (2) may he regarded as a cubic equation in x 
and is in reduced form, since there is no term in x'^. 
Equations (1) and (2) are the same provided that 

= —b, Zzv = —a. . . (3) 

To solve this pair of simultaneous eqations for z and v, 
write zH^ — — a®/27, so that 2 ® and w® are the roots of the 
quadratic 

A2+6A-a®/27 = 0, . . . (4) 

where the sum of the roots is — 6, and the product of the 
roots, namely 2 ®v®, is — a®/27. Hence A = 2 ® or z;® ; 
that is, 

A = _J6±V(6"/4+a®/27) = 2 ®, w®. . (5) 

Since 2 and v have entered the work symmetrically, it 
does not matter which is which : and so we let 2 ® be given 
by the positive, and v® by the negative, sign. Finally 
we take the cube roots and add, so that 

cB = a+z.= ^{-^6+V(6='/4+a3/27)}+4/{-i&-A/(&V4+a®/27)} . (6) 
This is the celebrated formula of Cardan (1573). 

Example. a:® + 3a;+8 = 0. 

Here x = 2+w, 2 ®+-!;® = —8, zv = —1 and A® + 8A — 1 = 0, 
so that A = — 4±V20 = — 4±2-v/5. 

Thus » = ^(-4+2V6) + ^(-4-2V5). 

Since every number has three distinct cube roots 
(p. 109) we have evidently obtained several values of 
X by Cardan’s formula. This is as it should be, for a cubic 
equation usually has three distinct roots. But at first 
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sight it appears that there are nine, or even eighteen, 
possibilities in the formula, since alternative solutions 
exist for square and also for cube roots. As for square 
roots, a glance shows that the signs are fixed — one must 
be positive and one negative. Accordingly we consider 
the cube roots. 

Let the distinct roots of x^—1 = 0 be 1, cu, o)^, so that 

= 0, £o = — ~^—i 

Then ce® = 1, and the cubes of z, coz, are each equal to 
g®. Similarly for v. Hence there are three-times-three ways 
of taking the formula (6) for x, owing to these combinations 
of the pair of cube roots. But whereas each of the nine 
ways will satisfy the first condition 

2;3_|_y3 _ — 5 (g (a)2;)®+(ajz>)® = —b) 

only three ways will satisfy Szv = —a, namely z with v, 
ojz with oj^v, and co^z with cov. We infer that the three 
roots for x are given by the formulae 

X = z-f-v, coz-j-co^v, coh-j-ojv, . . (7) 

where z and v are the cube root expressions of (0). 

A check upon the accuracy of this result is afforded by 
adding together these three roots. The sum is 

(l+m+a)®)(z+y), 

w'hich vanishes sinqe l+cu+m® = 0. 

Example. — 1 8a; — 35 = '0. 

Here x = 2 + 3 , 2a)+,3oL>2, 2ct)2 + 3ft>, 



The sum of the roots is zero. 

51. The Gas© of Equal Roots. If two of the roots 
are equal we equate a pair of the results 50 (7). For 
example, let z+v = coz+co^v, so that (1— at)^ = (co^—l)v 
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and z = {—\—ai)v = cx)H. From the three possible pair- 
ings of roots we find that when 

z = V oi: ojv OT . . . (1) 

two roots are equal. Substituting in 50 (3) we have 
2v^ = — b, — a = Bv^ or Bcov^ or Bat^v^, 
so that 4u® = b^, 27 = — a® in each case. Hence 

A = 62/4+^3/27 = 0. . (2) 

The expression on the left side of this equation is called 
the discriminant. Conversely, if this discriminant vanishes 
then z = V ov a)V ov coh^, as we see from 50 (4), for both 
z and V are in this case cube roots of — J6 : and accordingly 
two roots are equal. 

Three roots of the equation a;3+aa;+6 = 0 are equal 
when and only when a == 6 = 0, as is seen by making all 
the values (1). equal, so that z = v — 0. In this case the 
unreduced equatiori is more interesting, and the condition 
for three equal roots is that 

x^-\-ajX^-{-a2X-\-aQ 

should be a perfect cube. If so, it must take the form 
(a;— a) 3, and therefore = —3a, = +3a2, = —a®. 
This requires two conditions to connect the three co- 
efficients : for example, 

a|+3a2 = 0, — 0. 

The expression A is not only the discriminant of the 
cubic but also of equation 50 (4), which is called the 
quadratic resolvent. We shall now see that it discriminates 
further and complex roots. 

52. The Ordinary Case A >0. If the quantity 
A = 62/4+03/27 

is positive the formula 50 (4) gives real values of both 
z and V, so that x = z+w is also real. In this case the 
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other two values of x are cuz-l-io^v and wh-j-iov. But 

<0 = — ^ 4 "^ iV^ ’ 

hence these values of x are 

~i(^+v)±iix/3(z—v), 

which are necessarily complex, since z—v cannot vanish 
unless J = 0, whereas J has just been assumed to be 
positive. We infer that whe7i A >0 two roots are necessarily 
complex, and one only is real. 

The Irreducible Case, A<0. The above certainly 
suggests that the outstanding possibility, namely the 
case when J<0, furnishes the criterion for the existence 
of three real roots x, which is indeed the case. In fact, 

if J<0, then the quadratic of 50 (4) for A has complex 

conjugate roots, giving, let us say, 

z® = p-\-iq, V® = p—iq, . . (3) 

where p = — and q= y'(— J), which are both real. 

It is possible to proceed by using Demoivre’s theorem, 
after putting 

P = r CO& 6, q = r sin 6, 

so that r and 6 are real. Then, by Cardan’s formula, 

» = ^{■p+iq)+ 

= r^(cos 6-i-i sin 0)4+f'^(cos 6—i sin 6 )^ 

1 / 6-\~2]c7r , , S-\-2k7T 0-\-2hTT , . 6-\-21cTr\ 

= r3 cos — ; \-i sin — |-cos — ^ sin — — 

\ 3 3 3 3 / 

= 2r^ cos , where h is any integer . . ^ . (4) 

3 sfsir,-- JMte 

Incidentally we have satisfied the condition Zzv — —a, 
where a is of course real, by taking the same value of h 
in extracting the cube roots of p-\-iq and p—iq. In other 
words, z and v are conjugate complex numbers. 
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' By taking )b = 0, 1, 2 we obtain three distinct values 

6 I 2/j7r 

of cos — r — and therefore of x, each of which satisfies 


the cubic equation, so that we have discovered the 
requisite roots. Taking further integer values of h merely 
repeats the same values of x, owing to the identity 


cos(^ +27r) 


cos Furthermore, these roots are all 


real. Hence if A <0 all three roots are real. 


Conversely, if all three roots are real, then z-f-v must be 
real, so that z and v must be conjugate complex numbers 
; and it is easy to verify that each of the three roots 
as given by 2+v, coz-j-co^v, co^ -j-ojv is real. Hence 

= liz^-v^)^ = (Sihk^+i^kY = -{Zh-k)^l^. 


This expression is either zero (giving the case already 
treated in 51 of two equal roots) or is negative. Hence 
if the roots are real and unequal then zl<0. 

Once more, in this case, p = — |6, q = ^/{—A), so that 

^2 _ p2_j_g2 __ 1&2_J _ 


while r cos B = p, that is, 

cos 6 = — 6/2y'(— a^/27). 


Hence 


:= 2 


-a 


27 


3 .- 
COS 


B+2h 


m 


k = 0,l, 2. 


(5) 

( 6 ) 


We call this the trigonometrical solution of the cubic 
equation. 

To sum up : if a and h are real the cubic equation 
= 0 has 

(i) three real roots when 2l<0, 

(ii) two equal roots if z3 = 0, 

(iii) one real and two comple.x roots if A >0. 



REAL ROOTS 


123 


Here A — 6^/4+a®/27 : it is the discriminant both of the 
cubic and of the resolvent quadratic A^+^A— a.®/27 = 0. 

In case (i) z and v are conjugate complex numbers, 
in case (ii) they are real and equal, in case (iii) they are 
real and unequal. 

Cardan’s formula at once yields the real roots in 
cases (ii) and (iii), but, paradoxically, cannot do so in 
case (i) without introducing imaginary numbers. To 
mathematicians of the sixteenth and seventeenth centuries 
this feature was very mysterious : they spoke of the 
irreducible case. The method of Demoivre clears the 
matter up and yields the roots ; but it still remains a 
curious fact that from a real cubic three real roots cannot 
be extracted by Cardan’s algebraic formula without a 
circuitous passage into, and out of, the domain of complex 
numbers. (See 53.) 

Examples. 

1. x^ + l2x-ll2 = 0. 

Cardan’s formula gives 

,^{56+V(3136 + 64)}+^{56-V(3136 + 64)} 

== 3^1 12-56854+ ^(—0-56854) 

= 4-828... -0-828... = 4. 

The answer is exactly 4, but the working involves the use 
of tables. For the other roots, divide by x—i. Then 

a3*+4a:+28 = 0, and x = — 2±i2-v/6- 

2. Solve a;®+3a3® + 15®+25 = 0. 

Put cc + l — y, then 2/®+12j/+12 =0. 

Here a = 12, 6 = 12, Zl = 100 and 

^(-6 + 10)+ ®/(-6-10) = 1-5874-2-5198 -0-9324, 

so that X = y — l = — 1-9324. This is the only real root. 
Complex roots for y are given by 

co^4-to22 J'2and .^4-0.2 .^2. 
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To evaluate them, call them a-fijS, a~i^ ; then 

— 2a = -0-9324, 

since the sum of the three roots must be zero. Also the 
product of the roots is —12. Thus 2a(a® + ;S®) = 12, or 
= 6/a — a®. Finally a = 0-4662, /3 = 3-5571 and 

X = y — 1 = a — l±i^ = — 0-5338±3-567H. 

53. Alternative Trigonometrical Treatment, If 
real roots only are required we may proceed as follows. 
Consider the identity 

4 cos^^— 3 cos (f> = cos 3(^ . . (1) 

and suppose x^-\-ax-\-b = 0. If these two equations are 
the same, then, on comparing coefficients, we have 

a;®/(4 — —axj{3 cos (f>) — —bjcos 3^ 

so that —3*^ = 4a gos^c[>, or x = 2\/(—^a) cos <l> 
and b — 2v'(— a®/27) cos 3^. 

Thus cos Z4> is known in terms of a and b, whence 
z is known. It leads to the result 52 (6) on writing 
cos \{d-^2kTT) for cos and thus it solves the case of 
three real roots. 

Now eos 39?> cannot exceed unity : hence 
6<-2v'(-a727), 
so that A — 67‘^+®727<0. 

(i) If, here, A = 0, then cos 3^ = ±1, or </> = 2/-7 t/ 3 
and cos ^ = 1, — J, — ^ according as A; = 0, 1, 2. This 
gives 

X — 2a, — a, — a 

as roots, where a = •\/( — a/3). It is the case of repeated 
roots, as we have seen already in 51. 

(ii) If A<0, then 

z — 2\/(— a/3) cos 


~Z 


= 1 , 2 . 
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When, however, ^>0, cos 3^ is greater than unity. We 
obtain a real result by assuming 

X = 2'\/(— a/3) cosh^, b = 2-\/{ — a^/27) cosh 3^, ^>0, 

since 4 cosh®0— 3 cosh^ = cosh 3^. Since cosh 3^ is non- 
periodic as a real function, it is impossible to obtain 
further real roots. Indeed, as we already know from 52, 
the other two roots are. complex. 

Examples. 

1. x^-Zx + 1 = 0. 

Here a = —3, 6 = 1, A — — f<0. 

Take = 1, cos 6 = — 6/2r = — so that 

e = 120°. Then 

/120+360ifc\° 

X = 2 cos ( g j = 2 cos 40 , 2 cos 160°, 2 cos 280°. 

2. Solve a;3-27£c-l-27 = 0. 

3. Solve as® — 6a; — 9 = 0. 

54. The Equation of Squared Differences of the 
Roots. Let a, j3, y be the roots of the cubic equation 
z^-{-ax-\-b — 0, and let us seek an equation whose roots 
are (/?— y)®, (y— a)^, {a—jS)^. 

We take y = — y)^ = a^+^^+y^ — 2a^y/a — a®. 

But, by 28 (3), a-j-^S+y = 0, jSy+ya+ajS = a, a/3y= —b, 
so that a^+jS^+y^ = (a+^+y)^— 2(^y+ya4-ayS) = —2a. 

Therefore y = — 2a+26/a — a® — (26 — 2aa — a®)/a. 

But a®+aa+6 = 0. 

Hence {y-{-a)a = 36 or a = Zbj{y-\-a). 

On substituting this value of a in the cubic we have 

y®-f-6a?/®+9a^y+4a®-|-276® = 0 . . (1) 

Now one root of this new cubic is y = (^— y)®. Hence, by 
symmetry, the other roots are (y— a)®, (a— Also, the 
product of these tliree roots is — (4a®4-276®), which is 
-108 J. 
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Thus (jS— y)2(y— a)®(a— yS)® = — 108 J . (2) 

This is an important result, for it expresses the 
discriminant A explicitly as a symmetric function of the 
roots. It verifies (i) that A vanishes when, and only 
when, two roots are equal, (ii) that A<0 wlxen all the 
roots are real and unequal, and (iii) that A>0 when two 
of the roots are complex conjugate, 

55. The General Cubic Equation. Let us now 
consider the general cubic equation 

afpo^-\-Za^x^-\-Za^x-^a^ = 0 , . . ( 1 ) 

where for convenience in what follows the binomial 
coefficients 1, 3, 3, 1 are inserted. If 2 = aocr+ai, then 

2;®+3(aoa2— af)z+«o“3~^®o®i®2+2ai = 0 • (2) 

as is easy to verify. It is usual to write 

H — G = 3aoaia2+2af . (3) 

If a, y are the roots of the cubic in x, then the roots 
for z are evidently 

apy+oq . . . (4) 

But aga+aj— (ao^+aj) = ao(a — j5), which is the difference 
of the roots for 2 . Also the preceding result 54 (2) applies 
to the equation for 2 , since this lacks a term in 2 ® ; and 
we shall have 

^ = —108 A, (5) 

where 

A = 62/4+^3^27 = (? 2 / 4 +27hf 3/27 = ;|((52+4£f3) _ (0^ 

Hence, in terms of the roots a, y of the general 
equation 

«oa;3+3aja;2+3a2a:+a3 = 0, 

we have 

«o(i3-y)='(y-a)2(a-yS)2 = -27(G2+4H3) = -108 A . (7) 
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This exhibits the discriminant J in a more general 
hght : it is called the discriminant also of the more general 
cubic, and the characteristic properties remain true of this 
cubic according as Zl> = <0. 

Since dH and G have taken the place of a and b in the 
original treatment, we can sum up as follows : 

The general cubic has three unequal roots if G +4ir®< 0, 
has one real and two complex roots if and 

two equal roots if G^-\-4:H^ = 0. 

It has three equal roots if G — S — 0, for in this case 
we find at once that = ct^ja^ = ftg/ag, and the cubic 
in X becomes = 0. 


56. The Canonical Form of a Cubic. The co- 
efficients of a cubic 

aQX^-\-Za^x^-\-Za^x-\-a^ . . (1) 

may be regarded as the first four coefficients of a recurring 
series 

S = . (2) 

where • • • (3) 

is the scale of relation (23), p. 53, and 

a^v^OjU—a^ = 0, arjV-\-a^u—a^ = 0 . ■ (4) 

are the initial equations which give u and v. Thus 

^ ^ . (5) 

® 0 ® 2 — ^ 

By the usual procedure (p. 54) the sum S is given as 

S = {aQ-\-{aj^—uaff)t}l{l—ut—vt^) . . (6) 


which breaks up into partial fractions 




g 


. ( 7 ) 
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when that is, when u^-\-4cV^0. Here a, ^ are 

given by the identity 

{l—ut—vt^) — (14-ai)(l+jSf), . , (8) 

so that a and /3 are the roots of the quadratic 

x'^-\-ux — V = 0, 

or {aQa^—a^x^-\-{afp,^—a^a^x-\-{a^a^—a^ = 0, . (9) 

which is called (Aitken, Determinants and Ilatrices, p. 130) 
the Hessian of the original cubic (1). The reader may 
verify that the Hessian has equal roots when A vanishes. 
The Hessian can be written also in the alternative forms 

a^x-^-a-^ %a:+«2 ®o % ^2 

= 0, a-^ a^ Ug = 0. (10) 

a^x-\-a 2 a^x-\-a^ 1 — x 

On expairding the series for (7) and comparing the result 
with the series 8, we have 

p-Vq = Oo, pa-\-q^ = — «!, = aj, pa^ + q^^ = — Os • (H) 

Multiply these by x^, —3a;®, 3a;, — 1 respectively and add : 
then 

p(a;— a)®+g(a:— jS)® ^ aQX^-\-Za^x'^+Za^x-\-a^. . (12) 

Since p and q are known by the method of partial 
fractions, we have thus expressed a cubic as the sum of two 
cubes of linear forms. 

This is called the canonical form of the cubic, and the 
reduction to such a form is possible, whenever the roots of 
the Hessian are distinct, that is, when A ^0. 

The result is due to Sylvester, who showed that the 
same method would apply to any form of an odd order, 
cubic, quintic, septimic and so on. For a quintic 

Q = affi6^-\-6axX^-\-\0a^'^-^\Qa^x^-\-5a^x-^a^ . (13) 

we take a recurring series 8 — as before, but with a 

scale of relation 

“3 = a^-\-ap}-\-aQW, a^ = a^u-\-a 2 V-\-a^w, etc., . (14) 

where {\—ut—vt^—wt^) = {l-}~at){l-\-^t){l-\-yt) . (16) 
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and 


8 


^ + 

1 1 ■ 


1 "Yyi 


- Z aj'^. 

n=0 


The quintic is readily found to be 
Q ~ p{x—a)^-rq(x — 

where a, j8, y are the roots of the cubic 
x^-{-ux^—vx-\-w — 0, 


or 


. (16) 


ctoX+Oi aj^cc+ag 

Uq 


CTg 

ag 

aiCC+Cfg 

= 0, or 

Ug 

Ug 

«4 

a^x-l-a^ agX-l-a^ a^x-{-a^ 

ag 

Ug 


«5 


1 - 

—X 

X^- 

—a;® 


= 0. (17) 


The determinant, of order n when the original form is of 
order 2n—\, is called the canonizant of the form. The 
method also applies when the roots of the canonizant are 
repeated ; we then modify the partial fractions in the usual 
way and proceed as before. 

Examples. 

1. If, for the cubic, a = j8. 


then 


S= ^ 4- ^ 

1 


and the canonical form is 

r{x—a)^+s{x—a)^, where r =p4-q,s = —Bqa. 

2. Reduce ot^—Bx+2 to canonical form 

(a = j8 = 1; (a;-l)3+3(a:-l)2). 

3. Reduce 6x^+18a;+7. 

4. If a, j3, y are the roots of a cubic and w is a complex 
cube root of unity, prove that the roots of the Hessian are 

j8y+ctiya+a>^aj8 jSy+to"ya+«ajS 
a“l-tuj3+to®y wy 
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THE BIQUADRATIC OR QUARTIC EQUATION 

57. The Quartic Equation : Feasibility of a 
Solution by Radicals. The equation 

f = 0 . (1) 

is called a biquadratic or quartic equation. In general it 
has four roots a, y, 8 which can be found algebraically 
by reducing the equation to a cubic equation in rational 
steps. Since it is known that equations of degree higher 
than the fourth cannot be so reduced to lower equations 
it is of interest to enquire why a quartic admits such a 
reduction. The underlying reason is simply this : that 
if a cubic equation 

= 0 . . . ( 2 ) 

is formed, the roots of which are 

^y+aS, ya+^8, a|0+y3, . . • (3) 

then each of its coefficients is a symmetric function of 
a, y, 8. (This is easily verified : for instance the sum 
of the three roots is Z)Sy.) Accordingly ,.;p, q, r are known 
rational functions of the original a^, a^, a^, a^, a^. But 
the cubic can be solved ; so that j8y-(-a8 is known. Also 
a^yS = a^/ao- Thus we can obtain ^y and aS as roots 
of a quadratic. Similarly for all six binary products ^y. 
Hence each ratio : ay and therefore : y is known, 
from which we caj^^d not merely the ratios of the 
roots but the roJ^B^selves by using the fact that 
2a = —iaJttQ. 

58. Ferrari 's Method of Solution. Let the quartic f 

130 • - 
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of 57 (1) be multiplied by % and then expressed as the 
difference of two squares 

(aoa;2+2aia;+ag+2oo^)2 — . . (1) 

where, it will be seen, the coefficients of and x^ agree 
with those of the original, multiplied by a^. Comparison 
of the further terms in x^, x^ and cc® gives us the relations : 

— af—a-Qa^+ap, M2I = a^a^~aQa^-\-2aQa^6, 

= {a2-^r2ao^V-a^a■^. . (2) 

Eliminating M and N by squaring MN and equating the 
result to we find that 

4a303-ffioJ0+J = 0, . . . (3) 

where 

I flo % «2 I 

I — ao®4— 4ai<i3+3a|, J = Oj lig (4) 

ag (Xg 

The equation (3), which is usually called Euler’s reducing 
cubic, is typical of the reduction ; for whatever algebraic 
method of solution for the quartic is attempted, sooner 
or later such a cubic resolvent is bound to arise. 

Let u, V, w be the roots of the cubic. From one such 
root w'we at once find the values of uniquely. 

Hence M and N are known, apart from a sign which is 
immaterial, since the result gives us simultaneously the 
quadratic factors 

of the quartic a^f. On solving these quadratics we obtain 
the four roots a, /8, y, 8 of the original quartic. 

Examples, (See also Example 5, p. 139.) 

1. Solve a:*+6.x3+8.r+21 =0. 

Here Oq = 1, «! = 0, Og = 21, I — 24, 

J = 16 and 0® — 60+4 = 0, whic^^Psplved by 0 = 2. The 
equation now becomes 

{x^ + l+4:f-{2MxfN)^ = 0, 
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giving M = -^1 and iV = ^2. Thus we have a;^d-2a;+3 
and a;® — 2 . 1 ; + 7 as factors. 

The roots are — 

2. Solve the equations 

(i) a:^— 4a:^+9a;+4 = 0, {2M — ~2N — 3) 

(ii) x*‘ + 6x^-\-14x^ + 15x-\-4: = 0. 

Obviously the quadratic equations obtained by 
factorizing the quartic in the form (1) above can themselves 
be factorized in terms of the roots of the quartic, so that 
we shall have 

a;^x^-\-2a^x-\-a^-\-2aQU-{-2Mx-\-N = aQ(x—a){x—^), ( 6 ) 

-\-2ayX-{-a^-\-2a^ — 2Mx—N = aQ{x—y){x—h), 

in order that the product of the two qiiadratics may be 
a^TI{x — a) or a^f. Since the four factors x — a may be 
paired in three ways, as {a^, yS), (ay, ^8), (aS, /Sy), we have 
another explanation of the existence of a cubic equation 
for 6. The triple pairings will correspond to the three 
possible values u, v, w of 6 when substituted in (6). 

By comparing coefficients of powers of x in (6) we have 

2ffli+2Jf = — ao(a+;S), a2+2aoW-fiV = a^a^, 

2a^—2M — — ao(y+S), az~\-2aQU—N =■ a^yZ, 

so that ao(aj8+yS) = 2a2-\-4aQU, 

Similarly, a(,(ay+iSS) = 2a2+4a(,w, 

a^laS-^-^y) = 2a2+4aQt^. . . , (7) 

Since 6^ is absent in the cubic u-\-v-{-w must vanish. 
Hence 2(aj3+yS) — (ay-j-^S) — (a8+j8y) = Sw — 4v — 4w—\2u, 
or 12it = (a— 8)(j8— y) + (a— y)(/3~S). . . (8) 

This gives a root u of the cubic explicitly in terms of those 
of the quartic. Analogous formulae hold for v and w, 
namely, 

12v= (a— ^)(y— S)+(a— S)(y— /?), 

12?^; = (a-^)(S-y)+(a-y)(S-i8). 
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By subtraction we find that 

4:{u—v) = (a—8)0—y) . . ■ (9) 

and two similar expressions for v—w, w—u. Hence, by 
multiplying all three results of this type together, we 
have 


&i{v—w){w—u){u—v) = (a— S)(,6— y)(a— j6)(y— S)(a— y)(§— jS) (10) 

When u — Q the four numbers a, j8, y, S are said to be 
harmonically separated, and the relation is often written as 


{a^, yS} 


(g— y)(j8— 8) 
(a— S)(^— y) 


( 11 ) 


In this case one root of the cubic vanishes : hence J = 0. 
Conversely, if J = 0 the four roots of the quartic form 
harmonic pairs. 

If a — then v — w, as is seen at once from (7). 
Hence both qv/xrtic and cubic have repealed roots. By 
51, p. 120, the condition for this is that 


J = J3-27J2 = 0. . . . (12) 

This expression A is called the discriminant of the quartic. 
It will now be proved that 


al{^-yf{y-ana-^f{a-8)\^-8)\y-8f = 256 Zl, (13) 


For by 54, p. 126, we have 

n(w-v)2 = 108(P/27-/2)/64a6 
= (/3— 27J2)/I6a6 
= 

Hence, by (10), 

ain(a~^)^ = al 642i7(w-v)a = 256 J. 

Example. For x* — 1 = 0 wo find I = — 1, J — 0, 
403 + 0 == 0, and the roots of the quartic are ±1, ±i, while 
those of the reducing cubic are 0, ±^ 1 - 

The reader can prove from (13) that A<0 is the 
condition for two real and two complex roots, Zl>0 is 
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that for four real or four complex roots. {Cf. Burnside 
and Panton, Theory of Equations (1899), vol. 1, p. 146.) 

59. Geometrical Aspect of the Quartic Equation. 

Let us consider the locus x = y = 2t. It is the para- 
metric form of a parabola 8 whose equation is = 4a;. If 

S' = ax^+2hxy-\-hy^-{-2gx-\-2fy-\-c — 0 • (1) 

denotes another conic, then the points common to the 
two curves 8 and S' are given by 

at*‘-\-2ht^-\-4J3t^-\-2gt^-\-4:ft-\-c = 0, . . (2) 

which is a quartic equation for t. It has in general four 
solutions a, jS, y, S, giving four points A = (a®, 2a), and 
so on, all of which lie on each conic 8 and S' . 

Now consider the conic 

F — afiX^-\-2ayXy+a^^+2a^+2a^y+a^ = 0. . (3) 

It meets the parabola where 

aQt*‘ -{-4:0 ^^ = 0. . . (4) 

Also the points A, B, C, D common to F and 8 lie on the 
conic 

G ^ F —aQd{y^—‘ix) — 0 . . (5) 

for all values of B. The discriminant of this conic vanishes if 
Uo % aa+2ao^ 

Oi a^—a^B ttg = 0, . . (6) 

aa+ScSo^ a^ 

as is at once apparent by writing out F in full and re- 
arranging the terms. On expansion this condition turns 
out to be the reducing cubic 

4ao®0®— aoI0+J = 0. . . • (7) 

The three roots of this equation give three values of B 
for which the conic O degenerates into two straight lines ; 
and, since the lines must pass through the four points 
A, B, 0, D, they can only be the pairs AB, CD ; AC, BD ; 
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AD, BC. In either case G resolves into factors linear in 
X and y, or quadratic factors in t. But when x = 
and y = 2t are substituted in the expression G the terms 
involving $ disappear, so that G takes the same form as 



F ; that is, G becomes the quartie in t. Hence we have 
factorized this quartie into two quadratic factors by- 
choosing d to satisfy the reducing cubic. 

60. Canonical Forms of Even Order. Suppose we 
have 

— 0 , a-^v-^-a^u—a^ = 0 , a^v-j-a^u—a^ = 0 . ( 1 ) 

Then the coefficients of the quartie 

aQX^-\-4ia\x^-\-Qa^x^-\-4:a^x+a^ . . ( 2 ) 

satisfy the recurrence relation (1), which is the recurrence 
relation used also in the. case of the cubic in 56. These three 
equations for u and v can be simultaneously true only if 

Uq % 

J = =0. (3) 

a«> art 
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In addition to all the results of 56 (11), p. 128, we now 
have = a^. Hence 

= pix—a)^-\-q{x~^)^. (4) 

This shows that when J vanishes, the quartio is expressible 
as the sum of two perfect fourth powers, and conversely. 

The determinant | aQa^a^...a 2 n \ of the 27i-ic form 

aoa;2”4-2%aia;2n-i^(2%)(2)<*22J^”“^+---+a2n . (6) 

is called the catalecticant of the form. In the writings of 
Cayley and Sylvester such a form was often denoted by 

((Xq) •••3 1)^” . , . (6) 

If the catalecticant vanishes the form, or quantic, is 
expressible as the sum of n perfect 2nth powers of linear 
forms X— -a, each with a constant coefficient, where a is 
a root of the equation 

I 2( — I — 0, . . (7) 

the last row in this determinant being understood to be 
1, —X, x^, ..., (— x)", the first row being a^, a^, ..., 

Next consider the quadratic 

F = {a+Xa')x'^-{-2{h-\-)(h')x-{-b-\-\b' . . (8) 

It is a perfect square when 

(ci-|-Ao!'^)(6-1-A6^) — {}i-\-Xh')“ = 0, . . (9) 

where (9) is a quadratic for A in terms of the other quantities. 
It follows in general that, for two values A^ and Ag of A, 
we can write the quadratic F in the form p{x~a)^. 
Accordingly, let 

■f=ax^-{-2'hx-\-b, f ~ a'x^+2h'x-\-b' , 
and /+Ai/' = p(x-a)2, f+XJ' = q{x~^)~. . (10) 

On solving these two relations for / and/' we obtain 
/= r(x--a)2+5(x-|8)A/' =r'(x-a)2+s'(x-/3)2, . (11) 

where r, s, r', s' are suitable constants. 
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Again, since a quartic can always be resolved into 
two quadratic factors / and/', in tliree ways, we can reduce 
such a pair of quadratic forms to the present form, and 
thus express their product, the quartic, as 

A{x—a)^+&H{x—a)^x—^)”+B{x—^)^, . (12) 

where A = rr', QH = rs'+r's, B = ss'. 

This is the usual canonical form for a general quartic 
— the sum of two fourth powers and the product of their 
squares, with constant coefficients. Evidently our earher 
case, where J was zero, corresponds to the possibility that 
the coefficient H may vanish. 

This canonical reduction of the quartic typifies also 
the reduction for the general 2?2.-ic ; namely, that the 
general 2%-io can be expressed as the sum of n perfect 
2nth. powers of linear forms, affected by constant coefficients, 
together with a term involving the square of the product 
of these forms. 

61. Gregory’s (1675) Method of Solving a Cubic 
or a Quartic Equation. Let us consider the equation 

aj^+goj+r = 0. 

Put X = 

so that v^+32u®+(3z^+5)v+z®+3'2+>’ — 0. . (1) 

Now multiply this cubic by the arbitrary cubic 

v^-\-av^-\-hv-\-c . . • (2) 

and in the resulting sextic for v equate to zero the co- 
efficients of v^, v^, V. 

Then 3z+® = 0, 3z®-l-g'4-32a+6 = 0, 

az^~\-aqz-\-ar~\-Sbz^-\-bq-\-Szc — 0, 

bz^-{-bqr-\-br-\-^z^c-\-qc = 0. (3) 

Solve these equations for a, b, c in succession. 

Then a = — Zz, b = 62 ^ — q, and c = (5^-j-32r — 152*)/32, 
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Substitute in the final equation and we find that 

27z^—27z^r—q^ = 0 . . . ( 4 ) 

This is a quadratic for z^, whence z is found, and, with it, 
a, b and c. The sextic for v is of the form v^-\-Av^-\-B = 0, 
which can now be solved. Finally, x == z-\-v. 

Next consider the quartic x^-\-qx'^-\-rx-\-s = 0, with 
X = z-\-v, so that 

= 0 . ( 6 ) 

Multiply this by the arbitrary quadratic 
choosing a and b such that the resulting sextic in v may 
have no term in y® or y® or y. 

Then 4z4-« = Oj 42®+29'34-^+62%+g'a+462 = 0, 

a(2^+g'z2+r2+s)+6(42®+2^2+r) = 0. . (6) 

Hence a = — 4z, b = (20z®+2g'2— y)/4z 

and 642®4-32g'z^+(4g2— 16s)z2— = 0. . . (7) 

Solve this cubic for z^, whence z, a, b are found. Next 
solve the sextic for y, which is of the form 

v^-\-Av*‘-{-Bv^^C = 0, 

and so is a cubic in y^. Finally, x = z-\-v. 

Next, the quintic equation, x^-^qx^-\-rx^-\-sx-\-t = 0. 
Gregory put x = z-\-v as before, and multiplied the quintic 
in y by an arbitrary form y^®+ay^*+... +& of degree 15. 
He equated every coefficient in the resulting 20-ic to 
zero, except those belonging to y^®, y^®, y^°, y®, y®. This 
gave him sixteen equations for the fifteen unknown co- 
efficients a, b, ..., k and for z. If they could be solved, 
then y® would depend on a quartic and the value of x 
would be found. 

Death intervened (in 1675) before he had completed 
the full investigation. Actually the eliminations would 
have led to a sextic equation for a power of z, but the 
matter was not cleared up until the time of Abel early 
in the nineteenth century. So sure was Gregory that he 
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had discovered the general algebraic method of solving 
all equations that he was in correspondence with his 
friends Collins and Dary, arranging for the calculation of 
the eliminations for all equations up to the tenth degree ! 

Abel proved that the resolvent of any equation of 
degree w>4 was an equation of degree higher than n ; 
and this epoch-making result established the impossibility 
of solving general algebraic equations of the fifth or higher 
degree by rational processes and explicit root extractions 
in a finite number of steps. 


Examples. 

1. Solve (i) £c*-f 2a^— 14a;®4- 8 = 0. 

(ii) a:4-l=0. 

2. If a+jS — y+S for the general quartic, prove that 

O'^cfs— + = 0- 

Solve x*—8x^ + 8x^ + 32x — 44 = 0. 


3. Solve 


X a b c 

ax. . 

X 


■ 0 , 


X 1 . . 

1 2a3 1 . 

1 2aj 1 


= 0 . 


X 


1 2x 


The latter determinant is cos id if x = cos 6 ; hence 
Trr 

X = COS — , r = 1, 3, 5, 7. Generalize. 

8 

4. Express the discriminant of a cubic or a biquadratic 
as a determinant in the (p. 80). 

5. The following alternative form of Ferrari ’s method is 
useful in solving numerical examples. 

Solve ci;^-|-4ic^ + 8.r- + 7a: + 4 == 0. 

Let this equation be written 

= (ax+b)^. 

Then, on comparing coeihcients, 

2t—a^ == 4, 4:t—2ab = 7, t^—b^ = 4. 

Whence 4(2i ~-4)(i2--4) = 4:aW- =: (4^~7)2, 

so that Si53 ~ 32^2 + 24i + 1 5 = 0. 

A rational factor of this cubic is 2t — 5, yielding t == 2^, 
^ and leading to rational quadratic factors 

{x^-\-x + l){x^-\-3x + 4:) of the quartic. 
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ELIMINATION OF THE UNKNOWN FROM 
CONSISTENT EQUATIONS 

62. Dialytic Elimiaation. From n separate equations 
it is generally possible to eliminate n—\ unknowns. The 
following, which is called by Sylvester the dialytic method, 
provides a systematic means of eliminating one unknown 
from two equations. From the reverse point of view 
the vanishing of the eliminant may be regarded as the 
condition .under which the two equations in question 
possess a common root. 

Consider the case 

ax^-{-bx^-\-cx-\-d = 0, px^-{-qx-\-r = 0. . (1) 

Multiply the cubic by x, and the quadratic by x and also 
by a: so that five relations are obtained : 

ax^-\-bx^-\-cx^-{-dx = 0, 

ax^-\-bx^-{-cx-\-d = 0, 

px^-{-qx-\-r = 0, . . (2) 

px^-\-qx^-\-rx = 0, 

px^-}-qx^-{-rx^ — 0. 

If we treat these as five linear equations, homogeneous 
in x^, x^, X, 1, the condition for their consistency is, 
by the theory of linear equations (Aitken, Deterniimnts 
and Matrices, p. 64), 

abed. 

.abed 

151 = I . . ^ f I = 0. . . (3) 

. pq r 
\p q r . 
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Since the term involving a® in the expanded form of this 
determinant |i?| is easily seen to be —ah-^, the expansion 
does not vanish identically. But it is free from x : hence 
it is the required eliminant, or res^iltant. 

Similar methods apply to a pair of equations of degrees 
m and n : corresponding to the m-ic there are n rows in the 
determinant, corresponding to the n-ia there are m rows. 
In all there are m+w rows. This type of determinant 
is usually called (Aitken, Determinants and Matrices, p. 125) 
a bigradient. 

From three equations, consistent in two unknowns x and 
y, we could eliminate x from the first and second, and again 
from the first and third equation, giving two resultants 
containing y, from which y could be eliminated by a like 
procedure. 

63. Factorized Form of the Resultant. Suppose 
that we have 

ax^-{-bx^-{-cx-\-d = a{x—a){x — jS)(a:— y) =f{x), 
px^-\-qx-{-r = p{x—X)ix—ij,) — (f>{x). 

Then, when both f{x) and ^(cc) vanish simultaneously, one 
of A, /X must equal one of a, j8 or y ; and so the following 
relation must exist : 

ii2'l = (A— a)(A— ^)(A— y)(/x— a)(/x— j8)(^— y) = 0. (2) 

This can be written in terms of alternants (p. 47) as 

2l(Aju.a;8y)/Zl(A/.c)2l(ajSy) = 0, . . . (3) 

or again, since /(A) = a(A— a)(A— j8){A— y), and so on, 

p^a^B'l = 2>®/(A)/(/x) = a2^(a)^(/3)^(y) = 0. . (4) 

We have evidently hit upon several alternative forms of 
resultant for / and <f>, and the question arises, how are these 
related to the bigradient [i? j ? The answer is as follows ; — 
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By row-into- column multiplication we have 

abed. fjL^ A/a ju/y^ 

.abed ys A3 ^3 

. .pqr 

. p q r . ^ y X fJL a^a V^y 

pqr. . 1111 a^a P^<i>^ y^y ( 5 ) 

where fx=f{X), <^a=f^(a), etc. On taking determinants wc 
have 

\li\\R'\A{aPy)A{Xfx) — —fxf^j,a<j>^j>yA{aPy)A{Xij). ( 6 ) 
But |i2 [ = jxf = 4^a.4‘p4*ylP^- 

Hence [i?[|i2'| = — a^p'^\R'\^, or |i?| = — a^p^lR'l. . (7) 

The alternative forms of the resultant jRj — 0, ji2'j = 0 
are therefore clearly equivalent, as was pointed out by 
Professor E. T. Whittaker (Proe. Edin. Math. Soc., Series i, 
vol. 40 (1922), pp. 62-63) who gave a proof substantiallj’- 
the same as the above. 

64. Bezout’s Condensed. Eliminant. In 1779 Bezout 
gave a method which produces an %-rowed determinant 
as the eliminant of two equations of degree n. For 
example, consider the case w = 3 for two cubics 

f{x) = a^x^+a^x'^+a^x+a^ = 0, 

^{x) = bQX^-\-b-iX^+b2X-\-b^ = 0. 

Multiply these two equations successively by 

6o and ag respectively, 

6o£r+6i and ag^+aj, 
bQX^-\-bjX-\-bz and aga;®+%^+ 02 ) 

and subtract each time the products so formed. Then the 
results are the three following equations : 

+ |«0^2l^ + 

iao 62 .r 2 -f- (|ao63|-l-|fli62l)‘'« + = 0, 

Og^g X^ 4- -j- |a 26 ; = 0. 


( 2 ) 
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where \afp^\ represents the determinant , and so on. 

By eliminating from these three equations a; as distinct 
variables, we obtain the resultant as a three-rowed 
determinant equated to zero, thus, 

l^o^al 

K^2l l«0^3lH-l«l62l l«1^3 =0. (3) 

i^o^si 

This determinant is called Bezout’s ehminant, sometimes 
the Bezoiitian or Bezoutiant. The case for a general value 
of n is constructed in similar fashion. 


65. Relation between the Dialytic Eliminant and 
B6zout’s Eliminant. Taking first for illustration the 
case m ~ n = 3, let us consider the bigradient resultant 
matrix 


( Xq ( X ' j ^ ^^2 Ctg • • 

I 6^0 ^'2 ^3 * 

M — ‘ • «0 ®3 (1) 

. . ^2 ^3 

. 6o 62 ^3 

^0 ^2 ^3 • 

Premultiply i? (in ordinary inatrical, or row- into- column, 
multiplication) by a suitably partitioned matrix 


. 1 . . 

. ~bo Oq . 

* ^0 ^0 

60 ^2 ^2 




( 2 ) 
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The result is 


Uq Ct^ Clg Ctg 

a, ag 

KR= 

\aj) 

a^p 

The determittant |E^| of K is evidently a®, and the deter- 
minant \KR\ is likewise evidently multiplied by the 
three -rowed determinant of the elements in the lower 
right-hand corner. Hence, by taking determinants of 
both sides and cancelling a^, we have 

Cipl\ |®0^2l |<^0^3 

\R\= ap^\ \aps\M^p2\ l«i^3 (4) 

^O^sl |^2^3 


I I M • (3) 

!«0&2| I^O^sI 

** 0 ^ 3 ! “b 1^1^21 l^^si 


which shows that the bigradient and the Bezoutian 
eliminants are identical in value. 

The elements of the Bezoutian contain minors of the 
2nd order and of weight (in the {i, j)th element) z’+j — 1, 
these minors being taken from the rectangular two -rowed 
matrix 

■»o «2 ••• ««] in\ 

h h h h 

Oq Oj Og ... 

this last matrix being most obviously connected with the 
identity 


r -4 

^ ' 

0 

■ 

1 

. . . 



'o' 

rH 

0 

— 1 

, , , 


,0, 


where a is a common root of the associated equations which 
give rise to the eliminant. 
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66. Case of Quantics of Unequal Degrees. As for 
the case when m^n, it is easily derivable. In the trans- 
formation KR of 65, put = 0, and then alter [6^ 63] 

to [63 61 63]. This gives 


*^2 • 

. art a, a, a, 

1-5 . . &o 61 “ 62 

. 60 61 62 • 

^0 ^2 • 


60 61 62 

ao&3+l«AI |%&2 
<^0^2 l%^2l !<^2^2 


( 1 ) 


where the elements in the second and third rows are to he 
regarded as minors of the array or matrix 


<2o % <^2 Cl^ 

• 60 


(2) 


If we introduce a factor and write the top row of 
the eliminant as [a^pQ, the result is more 

symmetrical, while it still gives a genuine eliminant, since 

ao 7 ^ 0 . 

For the general case, where m—n = r, we take the 
fmidamental array to be 

[ tto a^ ag a^ -^r+i ••• I /q\ 

60 

by putting the first r coefficients hj in 65 (5) equal to 
isero and renaming the suffixes of the remaining so that 
they become b-^, 6,^, 


Examples. !• Obtain the dialytic and the Bezoutian 
form of the eliminant of (i) two quadratics, (ii) two quartics, 
(hi) a quadratic and a quartic. 

2. If/(ir) and g{x) are polynomials, prove that 

{f{^)9{y) — / (y)g{x))l{^ -y) 


is also a polynomial. 

3. Write out this derived polynomial of Ex. 2 in powers 
of y, when/(ir) is a cubic and g{x) a quadratic. By equating 
coefficients of powers of y to zero, and then eliminating 
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powers of x in the results, obtain the Bezoutian eliminant 
of/(aj) andgr(.r). 

4. Derive on the model of Ex. 3 an alternative method 
for obtaining the Bezoutian eliminant of two equations. 

67. Elimination and the G.G.M. Pi-ocess. Still 
another way of obtaining the resultant |i?] of two poly- 
nomials /(«) and <l>{x) is to perform the Euclidean G.C.M. 
process upon them. For example, if f{x) and j){x) are the 
cubics given in the equations 64 (1), the process begins 
with dividing (^{x) by f{x). To avoid fractions, multiply 
<^{x) by the non- zero constant : then the first quotient 
is 6o the remainder a quadratic whose leading term 
is \a^-j\x'^. Before the next division multiply the dividend 
by lao^il, again to avoid fractions, and continue in the 
same way. If / and have no common factor involving 
X the remainder will be a constant |i2|, a polynomial 
expression in the coefficients % and 6,-. As in 17, p. 38, 
we shall have an identity 

Af+B^^\R\. ( 1 ) 

Now if the equations 64 (1) are simultaneously true 
when a: = a, then f = = 0 a,t this value of x and hence 

|J?| = 0. But |E| is independent of x: hence jit;| = 0 
for all values of x and is in fact the same resultant as 
before. Conversely, if 1F| does not vanish it is impossible 
for / and ^ to have a common factor. 

This method gives the necessary and sufficient condition 
for / and ^ to have not only one root in common but also 
k roots in common. For if the intermediate remainder 
which is of degree k in x does not vanish identically while 
the next, which would usually be of degree Jc — 1, does 
vanish identically, then the G.C.M. is of degree Ic, and 
exactly k roots are common to / and <f). The condition is 
therefore given by equating eacli of the k coefficients of 
powers of x in this remainder to zero. 

It is interesting to know that all these remainders can 
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be given in dialytic form through a formula due to Cayley 
(1848). Por two cubics, 64 (3), it is 

% £*2 ag . x^f{x) 

tto Oi a, Uq zf(x) 

|ii| = I . Z a: al >(4 I . . ( 2 ) 

• 6o ^2 

b-^ b^ &3 X(f){x) 

6i 62 ^3 • x^{x) 

Here the sixth column can at once be reduced to 

{ • . • * } 

by subtracting col^, and so on : hence \Il\ is the dialytic 
eliminant of Sylvester. Delete row^, toWq, col^, colg from 
|jB| and the resulting four-rowed minor gives the pen- 
ultimate remainder \S\. Delete row^, row 45 col^, C 0 I 3 of 
|;S| and the first remainder is obtained. For a full 
discussion see Muir, History of Determinants (1920), voL 3, 
pp. 329-349, which comments on Trudi’s paper of 1862. 


Miscbulakbous Examples 


1. Between what values, must the quantity m lie if the 
roots of the equation 

a;^4-2a;(m+5) + 2m^-l-ll?n+23 = 0 
are to be real ? 

2. Draw the graphs of and of 8 — and hence find 
approximately the real root of the equation x^-{-x^ — 8=:0. 

3. Find the equation whose roots are the squares of 
those of x^+x+l=0. 

4. If f{x) is a polynomial of order n in x, prove /(a:)/ (—a;) 
is one of order n in 

5. If y=x^, and f{x)f{—x) = what are the roots of 

= 6 ? [The squares of those of x, 

6. Solve 27aj^— 45a;^ + 2Sir— 4 = 0, which has a repeated 

root. pS 2 —2■iz^y7 

Ls’ 3’ 3 ' 

k2 
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7. Solve a:®— 6a:^+9a; = 3 by taking x = 2+2 cos 6. Prove 
that one root is 4 cos® 10® and find the other roots. 

8. The sextic equation, x^-\-ax?-\-bx-\-c = 0, has three 
roots equal to a. Prove that c = —5a®. Find also a and b 
in terms of a, and the cubic equation which gives the other 
three roots. 

9. If a is a root of a;® = 2, show that 1+a+a® is a root 
of 2 /®— 3y®— Sy— 1 = 0, and solve the latter equation., 

10. Determine the nature of the roots of 

(i) 3.a;®+a:®-lla;+6= 0, 

(ii) 2a:®+3a: = 4, 

and evaluate their real roots to three significant figures. 

11. Prove that there cannot be two differing identities 

S = <t>{e), S — ipie) expressing S, a given polynomial sym- 
metric function of the roots x^, x ^ as a polynomial 
in their elementary symmetric functions ~ Exfij, 

By subtraction this would imply a nonzero polynomial F{6) 
in the e’s, which would vanish identically wlien expressed in 
terms of the x's. Let T = Xej^e 2 <ie/ ... be any nonzero term 
of F(e) ; that is A + 0. Expres.sed in terms of .tj’s let each 
term T be arranged in descending lexical order ith regard 
to Xj^. x^, ... , x„. Then T will necessaiily contain a leading 
termZ' == Ax^-^x^^Xg-^ ... , where P ~ p-\-q-\-r-\- ... ,Q = 

... R = r-\- ... , etc. 

Since p, q, r, ... are fixed uniquely if P, Q, R, ... are 
given, no two terms T can have the same leading term. Put 
the leading terms X, one from each T. also into descending 
lexical order. Since they all differ they too have a single 
leading term. This term therefore stands foremost of all 
the terms in x among all the terms T. Hence it is unique 
and cannot possibly cancel out from the assumed identity ; 
which is absurd. Thus no such identity exists, and the 
reduction of p. 75 is unique. 



CHAPTEE SSI 

FURTHER LIMITING AND APPROXIMATE PROCESSES 

68. Explicit Formula for the Roots of Eqiiations. 

Consider the polynomial F of the %th order, as given on 
p. 71, where 

F{x)={l—ax){l—^x)...{l ~Xx) = 1 — + . . . + ( — (1 ) 

and lfF{x) — hQ-\-hjX-{-h^x^ 

Multiply this last relation throughout by and use 
(1) p. 47. Then we find that the left-hand side expression 
yields 

1 1 

a jS 

(1— {l—^x)~^. 

while is now the coefficient of a;”+ on the right. Expand 
each of the n elements in the bottom row of this numerator 
determinant in ascending powers of x (as in (2) p. 70). 
Since x^ occurs in the lowest row only of this determinant 
and then along with a** etc., the coefficient of x^ on the 
right has for numerator the %-rowed determinant 

1 1 . 

a ^ . 

= A{0,l,...,n~2,p) (3) 

a’i-2 

a*’ . 

say, while the denominator is a particular case of this 
with p =71—1. For lower positive integral and zero 
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values of this alternant A vanishes identically, having 
two equal rows. Tor higher values it duly gives the h 
expressions in bialternant form 

h, = ^(0, 1, 2, «-2, n+r-l)jA{0, 1, 2, n-1) (4) 


Example. 

For a cubic, 7% 


1 

1 

1 


1 

1 

1 

a 

|8 

y 


a 


y 

a’ 


y’ 




y^ 


.4(017) 

' 4(012)’ 


Now suppose that among the n real or complex roots 
a, jS, ... the modulus |a| of a exceeds that of each of the 
rest. Then it follows that the ratio tends to a 

itself as r tends to infinity. For since |a|>|^|, therefore 
)^/al^->0. Hence jS’’ /a'^-^0 also, and likewise for each 
further root compared with a. On dividing each element 
of the bottom row in the numerator of both and 
by and then proceeding to the limit, the result follows. 
Thus, with n — 3, 

4(0, 1, r+3)/4(012) 

“4(0, l,f+2)/4(012) 

a3H(01) _ ^ 

'a2H(0lj 

where H(01) denotes the two-rowed alternant of j8, y. 
Similarly for n in general. 

The same propert}’’ holds in the confluent case provided 
that a is a repeated root whose modulus again exceeds 
that of each of the rest. Suitable confluent determinants 
now appear in (2) as they did on p. 48. Thus if a is re- 
peated exactly h times, the first k columns in both these 
determinants belong to a. The preceding argument for 
the limit then clears the bottom row of each determinant 
as before, except in these first k columns. Further division 
of these bottom rows by before taking the limit, 


1 

1 

1 


1 

1 

1 

a 

(3 

y 


a 

iS 

y 


0 

0 


o 

0 

0 



LIMITING PROCESSES 


151 


leads to zeros in all but the column of each determinant, 
and to the result a as before. Thus, with % == 3 and just 
two equal roots. 


hf 


1 

a 


1 

• 

1 


a 

1 

y 


a^'+s 

(?*+3)a^+^ 

yT+Z 





( r + 2 ) a ’'+2 yr+2 


• ( 6 ) 


This has proved the following theorem : 

Theorem. When the modulus of one root, repeated or 
unrepeated, exceeds that of each of the remaining roots, then 
the ratio hp+i/hj tends to this root for its limit as r tends to 
infinity. 

Let us call this the regular case, and denote all others, 
where the greatest modulus belongs to two or more distinct 
roots, as irregular. Evidently these last include the case 
when the greatest modulus belongs to a complex root 
(and therefore to its differing conjugate complex root) in 
a real equation. Two theorems for the regular case can 
now be proved. The first is a simplified form of one by 
Daniel Bernoulli (1728) : the second, which marks an 
epoch in the long history of equations by giving an explicit 
formula for a root, was discovered by E. T. Whittaker in 
1918 {Proc. Edin. 3£ath. Soc. 1, 36 (1918) p. 103). 

Corollary. By an algebraical, followed by an arithmetical, 
division the root with greatest modulus, in the regular case, 
can be found to any degree of accuracy. 

Proof. Divide mrity by F{x), using algebraical long 
division, and so obtain consecutive coefficients h^, ... 

in the quotient. (As in Horner’s method there is no need 
to wite down the letter x, only the numerical values of 
the terms.) At any suitable stage divide h^+x by hj., and 
the resulting quotient is an approximation to such a root, 
by the preceding theorem. 
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Examples. 

1. If F{x) — + the h series is 1, 9, 91, 

898, 8893, 88016, ... by long division. Also 88016-^-8893 = 
9*897..., within *001 of a. Horner’s method gives 9*898 .... 

2. Prove that the theorem holds even when any finite 
polynomial replaces unity in the numerator. What happens 
if this numerator contains a factor in common with the 
denominator ? 


Whittaker's Theorem. If the equation 

0 = 00— eiX+egX^-^-egX^+e^x^— ... , (e^ = 1) 

has a root a~'^ tvhich is smallest in absolute value, then 



02 O3 




' 



[®1 ®2 


Gi e-s 

Co ei 

I ®0 


Oq ^2 



. Cq 




( 7 ) 


provided that the series converges. 

Proof. Let this series be written in the notation 

- + • + + +... ( 8 ) 

e, e, 

where the rule of suffix formation is obvious, and the 
connexion between multiple suffix and the corresponding 
determinant is defined by the relation 

^2? ^<7+1 ^r-h2 * 

^r+1- , ... . (9) 


The principal diagonal e,, e^, ... of the determinant is 
at once fixed by the multiple suffix in . . . Each column 
is characterized by descending and consecutive suffixes. 
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At present we shall only need the case when aU suffixes 
p, q, r are equal within a multiple suf&x. As in the first 
column of above, negative sufl&xes cannot occur : 
instead, zero elements appear. 

By a theorem on dual bialternants (Aitken, Determin- 
ants and Matrices, p. 117) we have the identity 

^7rp ••• 

where {pqr and {jrp ...} are conjugate partitions (p. 2 ) 
of the same positive integer, and A,rp • • • is constructed 
analogously to the determinant (9). Simple instances of 
this have occurred at (2) p. 71, which yield = h^, e^ = 

^2 = ^3 = The graph, as on p. 2 , shows that 

622 = ^22 ^222 — ^ 33 - translating from e to h the 

first r terms of the sei-ies ( 8 ) now become 

^ _1_ ^11 ^22 I I r-l 

■*" ' ' ■ h^hs hr 

^ 

h V Us V Wr K-J 



But this tends to a“^ as r tends to infinity, by the Theorem 
of p. 151, and so proves the theorem. 

Corollary. If a finite number of hj vanish but all hp+j 
(i> 0 ) are non-zero, after a suitably chosen value p, then 

1 ^P-l I I /n\ 

- = — . . (J.1) 

CL hp bphp^_j 

This follows by the same methods of proof. We may 
note that the theorem and corollary cover the regular 
case. For in this case the limit of hr-ijhr exists so that 
only a finite number of terms h^ could vanish. The modi- 
fied series ( 11 ) is automatically convergent. 
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69. The Irregular Case. In 1924 Aitken * extended 
this result to give the elementary symmetric functions of 
a subclass ... of the roots, such that each modulus 

|a|, exceeds each modulus of an omitted root. 

This covers the irregular case wherein several distinct 
roots in the subclass have the same maximum modulus. 
Not merely the elementary but all the usual symmetric 
functions may be derived by such series. They depend 
on the ratio hplhq where P and Q are suitably chosen, 
equally numerous, multiple suffixes. 

To illustrate this let but |a| = |iS|>|y|, where 

y denotes any other root. This case of two such equal 
moduli needs the double suffix function 


h 


rr 


hy. 


and the identity = ^rr r-i ^r+i> r+i> 

which arises at once by appl 3 mg Jacobi’s theorem of the 
adjugate (cf. Aitken, Determinants, p. 98) to the cofactors of 
the four corner-elements in the three-rowed determinant 

^rrr' 

In fact we have 




h 


r-(-lJ r +1 


^0 

hii 

h 


+ 




hr 


r +1 


_j_ ^111 _l_ 

^11 ^11 ^22 ^rr ^r+lJ r-l -1 


^r-1 

h 



by repeated use of Jacobi’s theorem. Also by a Lemma 
which follows below, the left-hand side expression tends to 
1/ajS as r -> 00 , while the right yields the infinite series in 
A or e at pleasure. Thus, in the present example, 

1 =!«+ + + (12) 

ap ^2 ^22 " ^ 

* A, C. Aitken, Proc, Royal Soc. Edin., 45 (1925), p, 14; 46 
(1926), p. 289. 
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From the limit of we may siinilarly deduce that 


1 _l_ 1 I ^2 ^in I 


; !i -f- 

62 ®22 


+... (13) 


^222 


From (12) and (13) the values of a and ^ can now be found. 

Lemma. The limit of = l/aj8 as r -> 00 (14) 

Proof. We use Jacobi’s fundamental identity for 
bialternants, 

Ka-r = S'+l^ ...,r+n—l)IA{0, 1, 1) . (15) 


Cf. Aitken, Determinants, p. 116. For brevity take % = 4, 
and four distinct roots a, j8, y, 8. Then 


hr 


h 


r+1 


1111 

a ^ y S 

Q^r+2 ^r+2 yr+2 Sr+2 
^r+3 jgr+a yr+3 3 ’‘+3 


1111 

a jS y 8 

^>•+3 ^r+3 yr+3 ^r+S 
Q^r+4 |gr+4 yr+i 3*'+^ 


Expand each determinant by a Laplace development of the 
top 2 rows with the bottom two rows. Then divide both 
expansions throughout by a’jS’’. Since [(y or 8) -h(a or ^)y 
will occur in every term of each expansion, except the first 
term, the first alone of each will not tend to zero as r-»oo. 
Hence the quotient tends to 


1111 

a jS y 8 

a2 |S2 . . 

a3 /33 . . 


1111 

a (8 y 8 

a3 ^3 _ . 

. . 


which reduces to 1/ajS. Similarly for (13). The confluent 
case presents no difficulty if taken as in (6). 

Examples. 


1. Provo that a- 


^1 ^1^11 hAn 


. . . for the 
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2. Prove that a”^ = -f2 -j- j 122 ^ 

^11 ^11^111 ^111^1111 

3. If a:” ... , prove that 

^n+2?-l ^ ^ 

[Hints: 1* Use 2. ^r/^r-i- 2 l 

70 . Other Approximate Methods for Solving 
Equations- For further details on numerical solutions 
of equations the reader should consult Whittaker and 
Robinson’s Calculus of Observations (1926), pp. 78-132. 
But two more of the early methods may be mentioned 
here for their simplicity and interest. 

Anderson's Method.^ To solve f{x) =0 having 
found a fair approximation x ~ a, put x = «(l+2/)/(l— 2/) 
so that y is necessarily small. Obtain the corresponding 
equation for y but neglect third and higher powers of y, 
say p+qy+ry'^ == 0. Then y = —piq nearly, so that 
y = ^pj(^qJ^ry) = —pliq—prjq) more nearly. From this 
a greatly improved value of x is obtained. The method is 
powerful and applies to equations algebraical or tran- 
scendental, and can of course be iterated. Anderson 
claimed that it solves x^ = d more easily than = d. 

Example. Solve — 2x == 5. Here ir = 2 is a good start : 
therefore put x === 2{l ^y) : whence 
1 = 432/ + 132/2 + 92/3. 

Thus 2/ = -4 ^ approximation, and next y = 1/(43 + 132/) 

= 43/(432 + 13) = 43/18G2. This gives a: -= 3810/1819 == 
2-094557..., which is within -00001 of the true result 
2-09455148.... 

Maclaurin ’s Method. After taking an approximate 
value a, put x == a-\-y, so that y is small. Proceed as 
before but retain powers of y beyond the second if neces- 

* G, Anderson (1739), Letter to Jones, Rigaud’s Gorrespondence of 
Scientific Men (1841). 
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sary. Iterate. The process is more laborious than Ander- 
son’s to the same accuracy. 

Example. For a:® — 63a:— 50 = 0 take x = l+y 
where y is small. This gives 1 = 36 j/ — oxy — 
1/(36 — 12?/ +2/®)* Hence y = 1/36 nearly, and, better still, 

y — 1/^36— ^ = '02803. On putting y = • 02803 -fz 

and solving for z but neglecting k® and higher terms, the 
result is x = !• 02803923127 very nearly. 

71. Newton's Limits for the Roots. After stating 

in the Arithnietica U7iiversalis * the rules for the sums of 
powers of the roots (as quoted on p. 72), Newton gave a 
series of interesting theorems including the following ; 
(1) a the greatest root as r oo, (2) > s/ 

when r is odd, (3) ^r+i)+l'Sr] Newton 

confined these to equations all whose roots were real. 
These investigations so early in the development of the 
subject are interesting, particularly when they are com- 
pared with the results given above, using the h and e 
symmetric functions. The fimit of could equally 

well have been used (which would have led to a new but 
more complicated type of determinantal series on the 
basis of (7) p. 74), but, on the contrary, the root of 
hj. would not tend to a limit. 

Newton’s rule, given above on p. 97, follows in the 
Ariihmetica immediately after this discussion of s^. 

72. Newton’s Rule of Signs. To discover the com- 
plex roots Newton thereupon gave a generalization of 
Descartes’ rule of signs which runs as follows : from the 
equation 

f{x) — = 0 

form the 7-eal simple elements Oq, a^, stripped of 

I. Newton, Universal Aritlimciic, translated by Raphson, 
revised by Wilder, 1769. (Maclaurin’s method, p. 505.) 
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their binomial coefficients. Then form the quadratic 
elements, namely 

^0 == ao®2> ^2 = «2 ^— ■■■, = <^n- 

Let ^ I form sso. associated couple and — ^ -r^ r associated 

Ar\ ^ A.Ar^J 

couple of successions ®»-+i .4^.4^+!. Let p, P, v, V 

respectively denote a permanence or variation of sign in a 
succession of small or capital letters. There are evidently 
four possibilities in an associated couple, pP, vV, pV and 
vP. 

Newton’s Rule : — 

(1) The number of negative roots ^ SpP. 

(2) The number of positive roots ^ NvP. 

(3) The number of complex roots ^ NV. 

This rule was first proved by Sylvester, * nearly two 
centuries later, who incidentally pointed out that (1) follows 
at once from (2) on' changing x to — x in f{x), while (3) 
follows from (1) and .(2) directly since ZpP+NvP = 
NP = n — EV. Newton adapted the rule to the case 
when some of the vanished. The inequalities are neces- 
sary, “ but,” says Newton, “ you may know almost by 
this rule how many roots are impossible.” 

Examples. 

1. In a;® -f- l,2a5 -|- 9 = 0 the a and A series are 1, 1, 
4, 9 and 1, —3, 7, 81 respectively, with two variations V of 
sign in A. Hence at least two complex roots. 

2. In ‘ 2 ;® + 6a;® + 9a; — 16 = 0 the signs are 

a + + + - 
-4 + + + + 

giving SV = 0. But there are two complex roots. 

* J. J. Sylvester, Proc. London Math. Soc., 1 (1865-1866), p. 1. 
<7/. Collected Woi'ks, 2, p. 498. 
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